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Chapter  1 

Sequential  information  in  the  brain 

1.1  Phenomenology 

The  inner  workings  of  the  brain  are  still  poorly  understood.  Study  it  though  we  may, 
the  deep  truths  of  its  inner  workings — the  ways  that  it  both  processes  and  stores 
information — elude  us.  Even  in  organisms  far  simpler  than  humans,  this  problem 
remains.  In  1986,  White  et  al.  [1]  revealed  the  the  entire  connectome — the  neurons 
and  their  synaptic  connections — for  the  hermaphroditic  roundworm  C.  clegans.  Yet 
with  this  map  of  all  302  neurons  and  around  7000  connections,  we  still  do  not  fully 
understand  how  the  brains  of  C.  elegans  work.  Comparing  this  to  the  estimated  86 
billion  neurons  in  the  average  adult  human  nervous  system  [2],  the  task  of  under¬ 
standing  our  own  brains  is  daunting  to  say  the  least.  As  such,  it  is  not  surprising 
that  the  body  of  literature  describing  and  modeling  the  observed  phenomena  of  the 
brain  keeps  growing.  Here,  we  develop  tools  for  use  in  the  analysis  of  neuronal  data 
that  accompanies  these  phenomena.  Though  our  tools  are  of  general  purpose,  we 
focus  on  specific  phenomena  observed  in  a  particular  area  of  the  mammalian  brain: 
the  hippocampus. 
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Figure  1.1:  Place  fields  and  neuronal  spiking.  The  animal  here  has  four  place  cells 
(not  pictured)  each  with  a  preference  for  spiking  while  the  animal  is  at  a  particular 
location  within  a  straight  track.  The  four  place  fields  are  shown  as  colored  regions 
on  the  track  (bottom).  The  traversal  of  such  a  track  results  in  the  generation  of  a 
neuronal  sequence  (top)  called  a  place-held  sequence,  which  reflects  the  direction  of 
traversal  through  the  track. 


The  hippocampus  is  widely  believed  to  aid  in  two  main  functions:  episodic  memory 
and  spatial  navigation.  Episodic  memories  are,  broadly  speaking,  memories  that  are 
associated  with  events  in  one’s  life  (times,  places,  emotions,  etc.);  this  type  of  memory 
is  in  contrast  with  other  types  of  memory  such  as  semantic  memory  (knowledge  of 
facts)  and  implicit  memory  (e.g.,  how  to  ride  a  bike).  Evidence  for  the  participation 
of  the  hippocampus  in  the  formation  of  new  episodic  memories  was  first  reported  in 
[3]  in  1957.  A  man  referred  to  as  Patient  HM  underwent  a  surgery  to  remove  large 
portions  of  his  hippocampal  formation  in  an  attempt  to  be  cured  of  severe  epileptic 
seizures.  Although  the  surgery  was  successful  in  regards  to  reducing  the  frequency 
and  severity  of  the  seizures,  there  was  an  unexpected  side  effect:  Patient  HM  no  longer 
seemed  to  be  capable  of  forming  new  memories  about  his  experiences.  Research  on 
Patient  HM  and  his  condition  continued  until  his  death  in  2008. 

The  clearest  evidence  of  the  participation  of  the  hippocampus  in  spatial  navigation 
is  the  existence  of  place  cells,  first  observed  in  1971  in  [4],  These  neurons  have  a 
preference  for  spiking  when  the  animal  is  occupying  a  certain  location  in  space,  as 
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illustrated  in  Figure  1.1.  The  spatial  area  in  which  a  place  cell  has  a  tendency  to  spike 
is  called  its  place  field.  Traversal  of  a  series  of  place  fields  results  in  the  generation  of 
a  neuronal  sequence  called  a  place-field  sequence,  such  as  pictured  in  Figure  1.1.  It  is 
believed  (e.g.,  [5],  [6],  [7])  that  generation  of  place-field  sequences  is  intimately  related 
to,  and  perhaps  dependent  upon,  a  hippocampal  oscillation  of  the  local-field  potential 
(LFP)  called  the  theta  oscillation  that  exists  whenever  an  animal  is  in  locomotion. 

A  hippocampal  event  called  a  sharp-wave  ripple  (SWR)  has  also  been  tied  to  the 
formation  of  new  episodic  memories.  Such  an  event  is  illustrated  in  Figure  1.2.  As 
pictured  in  this  figure,  a  SWR  is  characterized  by  two  main  features:  (1)  a  sharp  wave, 
which  is  a  separation  in  the  local-field  potential  (LFP)  across  the  pyramidal  layer 
of  the  hippocampus,  and  (2)  a  burst  in  neuronal  activity,  often  observed  as  a  high- 
frequency  ripple  in  the  LFP  within  the  pyramidal  layer.  When  SWRs  were  selectively 
eliminated  from  neural  activity  during  post-training  memory-consolidation  periods  in 
[8],  animals  exhibited  reduced  performance  in  spatial  memory  tasks,  indicating  that 
SWRs  are  necessary  for  the  consolidation  of  episodic  memories. 
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Figure  1.2:  A  sharp-wave  ripple  in  the  hippocampus.  On  the  left,  we  see  an  image 
of  a  rat  hippocampus  with  a  probe  across  the  pyramidal  layer  of  the  CAl  region  of 
the  hippocampus.  In  the  upper-right  section  of  this  figure,  we  see  a  real  example  of 
local-held  potentials  from  above  and  below  the  pyramidal  layer.  This  separation  of 
the  local-held  potential  is  called  a  sharp  wave.  Below  the  sharp  wave,  the  increase  in 
spiking  that  accompanies  the  SWR  is  showin  in  a  spike  raster  plot. 


1.2  Neuronal  sequences 

Given  that  neuronal  sequences  are  important  for  storage  and  processing  of  information 
in  the  brain,  it  is  crucial  that  we  possess  a  reasonable  model  of  sequences.  Similarities 
between  place-held  sequences  and  SWR  sequences  [9]  give  reason  to  believe  that 
properties  such  as  sequence  duration  and  total  number  of  spikes  (and  even  the  precise 
timing  of  spikes)  are  perhaps  less  important  (or,  rather,  manifestations  of)  some  other, 
more-intrinsic  properties  that  seem  to  be  less  immediately  obvious  when  considering 
and  comparing  sequences.  An  appropriate  sequence  model  should  ideally  capture 
these  intrinsic  qualities  and,  in  so  doing,  address  the  fundamental  question  of  what  a 
sequence  actually  is. 

With  clean  illustrations  such  as  that  given  in  Figure  1.1,  the  concept  of  sequence 
seems  simple:  A  sequence  is  intuitively,  more  or  less,  just  an  ordering  of  neurons. 
Though  other  characteristics  stand  out,  the  fact  that  there  seems  to  be  a  clear  ordering 
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Figure  1.3:  Sequences  from  rat  hippocampus.  The  SWR  sequence  in  panel  (a)  and 
the  place-field  sequence  in  panel  (b)  are  very  different  from  each  other  in  many  ways 
including  duration  and  number  of  spikes.  Still,  these  sequences  share  a  similar  trend 
in  the  general  order  in  which  neurons  seem  to  spike. 


is  a  large  part  of  what  makes  a  sequence  seem  to  be  structured  to  us.  This  intuition, 
however,  breaks  down  as  we  start  to  look  noisy,  real-world  data  such  as  that  in 
Figure  1.3.  How  do  the  two  sequences  in  this  figure  compare  to  each  other?  These 
sequences  appear  to  the  eye  to  have  similar  trends  in  their  neuronal  firing  patterns. 
How  can  we  characterize  and  quantify  this  relationship?  This  example  immediately 
reveals  that  our  intuitive  notion  of  a  sequence  is  not  broad  enough  to  encapsulate  the 
sequences  observed  in  real  data.  The  situation  becomes  even  harrier  when  we  think 
about  asking  a  computer  to  tell  us  which  sequences  are  similar. 

In  the  literature  that  deals  with  neuronal  sequences,  oversimplified  mathematical 
models  have  arisen  to  describe  the  concept  of  a  sequence.  These  models  have  served 
a  purpose  and  pushed  the  field  forward,  but  they  are  limited  in  their  representation 
of  such  a  broad  class  of  events.  Chapter  2  introduces  a  generalized  model  that  more- 
robustly  represents  these  sequences. 


1.3  Summary  of  thesis 


In  the  first  half  of  this  thesis  (Chapters  2-3),  we  introduce  a  novel  tool  for  char¬ 
acterizing  and  comparing  neuronal  sequences:  the  bias  matrix.  Chapter  2  provides 
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characterizations  of  neuronal  sequences,  including  the  introduction  of  the  bias  matrix 
(Section  2.3).  In  Chapter  3,  we  investigate  mathematical  properties  of  bias  matri¬ 
ces,  including  a  description  of  their  relationship  to  the  previously  used  center-of-mass 
representation  (Section  3.1),  a  proof  that  all  bias  matrices  lie  in  the  fundamental 
polytope  (Theorem  3.25  in  Section  3.3),  and  a  proof  that  the  adjacency  matrix  of  any 
simple  digraph  can  be  realized  as  the  sign  matrix  of  some  bias  matrix  (Theorem  3.34 
in  Section  3.4). 

The  second  part  of  this  thesis  (Chapters  4-5)  describes  how  we  use  these  tools  to 
analyze  neuronal  sequences  from  the  hippocampus  of  rats.  Chapter  4  uses  the  bias 
matrix  to  introduce  a  statistical  tool  for  detecting  non-random  correlations  among 
sequences  of  neuronal  activation.  Chapter  5  introduces  the  experiment  in  which  the 
data  was  recorded  and  describes  the  various  types  of  sequences  that  are  of  interest. 
In  Section  5.3,  the  techniques  developed  in  Chapter  4  are  used  to  investigate  the 
similarities  between  the  aforementioned  sequence  types.  Appendix  A  serves  as  a 
README  hie  for  the  code  base  that  was  developed  for  these  analyses.  Most  of  the 
code  used  for  this  analysis  was  written  from  scratch  for  this  analysis. 
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Chapter  2 


Characterizations  of  sequences 


As  discussed  in  Section  1.2,  representation  of  neuronal  sequences  as  orderings  of 
neurons  is  appealing  because  it  makes  sequences  easy  to  describe  and  easy  to  interpret; 
but  it  falls  short  in  that  (1)  it  does  not  capture  the  complex  relationships  that  exist 
between  neurons,  especially  as  observed  in  noisy  data,  and  (2)  it  has  no  simple, 
realistic  biological  interpretation.  In  this  chapter,  we  describe  a  new  representation 
of  sequences  that  addresses  these  issues  and,  as  we  will  see  in  Chapter  3,  generalizes 
the  concept  of  a  neuronal  ordering. 

2.1  A  combinatorial  characterization 

Throughout  the  remainder  of  this  document,  we  denote  the  collection  of  neurons  by 
A f  :=  [n] ,  where  n  G  N.  The  raw  form  of  a  sequence  with  which  we  start  is  called  a 
spike  train: 

Definition  2.1.  A  spike  train  is  a  set  of  pairs  of  firing  times  and  neurons,  that  is,  a 
set  of  the  form  {(t,  i)  \  t  G  M,  i  G  Af}. 
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Figure  2.1:  Spike  raster  plot.  To  get  a  sequence  from  a  spike  train  pictured  as 
a  spike-raster  plot  (as  shown  here),  we  simply  list  the  neuron  for  each  spike  in 
order  of  occurrence  (left  to  right).  Hence,  the  corresponding  sequence  is  s  = 
(1,  2, 4, 4, 4, 4,  3,  3, 1,  3, 1, 1,  2, 4)  =  12444433131124. 

We  have  already  seen  graphical  examples  of  spike  trains  in  Figures  1.2  and  1.3  in 
the  form  of  spike  raster  plots.  As  seen  in  the  real-world  sequences  introduced  in  Chap¬ 
ter  1,  neuronal  sequences  occur  on  various  timescales;  but  their  sequential  nature  is 
independent  of  the  timescales  on  which  they  occur.  As  such,  an  appropriate  sequence 
representation  will  be  timescale  invariant  while  still  maintaining  complex  relationships 
between  the  spikes  of  neurons.  Although  spike  trains  maintain  complex  relationships 
between  the  spikes  of  neurons,  this  representation  is  not  timescale-invariant  since 
each  spike  has  an  associated  time.  If  we  instead  discard  this  time  information  and 
maintain  only  the  order  in  which  spikes  occur,  the  resultant  representation  will  be 
timescale-invariant  and  also  represent  complex  inter-spike  relationships.  This  brings 
us  to  our  formal  definition  of  a  sequence. 

Definition  2.2.  A  sequence  is  a  finite  list  of  neurons. 

If  s  =  (si,  s2,  •  •  • ,  is  a  sequence,  we  will  often  write  s  as  s  =  SiS2  •  •  •  Sg.  Given 
a  spike  train  T  =  {(A,  A)> . . . ,  (A,  A)}  with  A  <  ■  ■  ■  <  ti  and  A  <  A+i  if  A  =  A+i, 
the  corresponding  sequence  is  (A,  ■  ■  • ,  A)-  An  example  of  this  process  is  illustrated  in 
Figure  2.1. 

The  set  of  all  sequences  on  the  neuron  set  A f  is  represented  by  S(N)  or,  if  A f 
is  understood,  by  S.  Table  2.1  contains  some  terminology  and  notation  for  working 
with  a  sequence  s  =  (si, . . . ,  sg)  and  a  neuron  i. 


II  I 

I  I 

I  I  II 
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Table  2.1:  The  terminology  of  sequences 


Notation 

Terminology 

Meaning 

a  spike  of  s 

an  entry  of  a  sequence  s 

i  spikes  in  s 

i  =  Sk  for  some  k  G  [fi\ 

len(s) 

length  of  s 

the  number  of  spikes  in  s 

supp(s) 

support  of  s 

collcctoin  of  neurons  that  spike  in  s 

Li( s) 

spike  set  of  i 

collection  of  indices  where  i  spikes 

Ci(s) 

spike  count  of  i 

number  of  times  that  i  spikes 

ms) 

firing  rate  of  i 

proportion  of  spikes  that  are  equal  to  i 

Example  2.3.  With  s  from  the  previous  example,  we  see  that  neuron  1  spikes  four 
times  and  has  spike  set  L i(s)  =  {1,9,11,12};  hence,  the  spike  count  of  1  in  s  is 
ci(s)  =  4.  Similarly,  C2(s)  =  2,  C3(s)  =  3,  Czt(s)  =  5,  and  q(s)  =  0  for  any  other 
neuron  i  G  A f.  Since  len(s)  =  14,  the  firing  rate  of  each  neuron  in  s  is  /q(s)  =  -jL, 
P2( s)  =  ds(s)  =  and  p4(s)  =  n- 

Definition  2.4.  Given  a  sequence  s  =  (,s, . ^  ,sq)  and  an  index  set  K  = 

{ki, . . . ,  k£>}  C  [£]  with  k\  <  ■  ■  ■  <  kf,  we  define  the  subsequence  of  s  corresponding 
to  K  to  be  s k  :=  (s^, . . . ,  Sk ,,).  We  say  that  a  sequence  s'  is  a  subsequence  of  s  if 
s'  =  Sk  for  some  index  set  K  C  [£]. 

Example  2.5.  Let  s  =  (1,  2, 4, 4, 4, 4,  3, 3, 1,  3, 1, 1, 2, 4)  be  the  sequence  from  Fig¬ 
ure  2.1,  and  let  AT  =  [6]  and  AT  =  {1,2,9,11,12,13}.  Then  the  subsequences  of 
s  corresponding  to  AT  and  AT  are  sKl  =  (1, 2, 4, 4, 4, 4)  and  sK2  =  (1,  2, 1, 1, 1,  2), 
respectively. 

The  following  operations  will  be  useful  in  our  discussion  of  sequences. 

Definition  2.6.  Given  any  two  sequences  u  =  (rq, . . . ,  tq)  and  v  =  (iq, . . . ,  vm),  de¬ 
fine  the  concatenation  of  u  with  v  to  be  the  sequence  u-v  =  uv  =  (Tq, . . . ,  iq, ... ,  vm). 

Notation  2.7.  For  any  neuron  i  G  J\f  and  any  positive  integer  m  G  N,  let  im  denote 
the  sequence  (si, . . . ,  sm)  with  Sk  =  i  for  all  k  G  [m\. 
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2.2  Center-of-mass  sequences 

A  sequence  can  induce  an  ordering  by  many  means.  One  sandard  way  of  performing 
this  reduction  is  by  considering  the  average  location  in  which  a  neuron  fires. 

Definition  2.8.  For  each  neuron  i  G  J\f  and  sequence  s  =  (si, . . . ,  sg),  we  define  the 
(unnormalized)  center  of  mass  of  neuron  i  in  sequence  s  to  be 


center*  (s) 


FIs)  EfceLi( s)  k>  c*(s)  >  0; 


len(s)+l 


c*(s)  =  0. 


Note  that  centers)  G  [1,  len(s)]  for  all  i  G  A f.  Moreover,  when  a  neuron  does  not 
fire  in  a  sequence,  the  corresponding  center-of-mass  value  is  chosen  so  that  it  falls  in 
the  middle  of  the  range  [1,£],  Thus,  center*  (s)  reduces  all  spikes  from  neuron  i  in 
sequence  s  to  a  single  value;  but  this  value  is  dependent  on  the  length  of  s.  To  allow 
for  comparison  of  sequences  of  different  lengths,  we  shift  and  normalize  center *(s)  so 
that  the  resultant  value  lies  in  the  interval  (—1, 1)  regardless  of  the  length  of  s: 

Definition  2.9.  For  each  neuron  IgA/"  and  sequence  s  =  (si, . . . ,  sg),  we  define  the 
(normalized)  center  of  mass  of  neuron  i  in  sequence  s  to  be 


7*(s) 


2  center*  (s)  —  1 
len(s) 


Note  that  with  this  definition,  neurons  that  do  not  fire  in  the  sequence  have 
7i(s)  =  0.  If  the  sequence  consists  of  just  one  spike,  then  the  corresponding  neuron 
has  center* (s)  =  1  and  hence  7*(s)  =  0.  On  the  other  hand,  if  a  neuron  fires  only  once 
in  the  sequence  and  this  corresponds  to  the  first  or  last  spike,  then  y*(s)  =  —  1  + 
or  7*(s)  =  1  —  p^is),  respectively,  so  that  the  endpoints  of  the  [—1, 1]  range  are  only 
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attained  in  the  limit  as  lcn(s)  — »  oo.  These  centers  of  mass  can  now  be  used  to  induce 
an  ordering  on  the  underlying  neurons. 

Definition  2.10.  We  define  the  center-of-mass  ordering  of  s  to  be  <r(s)  :=  (i1; . . . ,  in) 
where  {4,  •••,4}  =  [n]  are  such  that  7u(s)  <  •••  <  7i„(s)  and,  in  the  case  that 
77  (s)  =  77+i  (s)>  sucl1  tliat  4  <  4+i- 

Example  2.11.  Let  s  =  12444433131124.  Then,  as  we  saw  in  Example  2.3,  Li( s)  = 
{1,9,11,12}  and  ci(s)  =  4.  Now,  centeri(s)  =  ^  and  7i(s)  = 

2(M/i)-i  —  1  =  A.  Similarly,  we  find  that  center (s)  =  (A.  A.  f,f)  and  y(s)  = 
(^,°,  Further,  74(3)  <  72(s)  <  71  (s)  <  73 (s)  and,  hence,  a(s)  =  (4,2, 1,3). 

It  is  worth  mentioning  that  when  the  concept  of  centers  of  mass  is  used  in  the 
literature,  the  starting  point  is  usually  spike  trains,  not  our  concept  of  a  neuronal 
sequence.  When  starting  with  spike  trains,  the  spike  times  of  a  neuron  are  aver¬ 
aged  instead  of  its  indices  in  the  sequence;  these  average  firing  times  then  induce  an 
ordering  in  the  same  way  that  the  centers  of  mass  do. 

2.3  Pairwise  biases 

If  each  neuron  active  in  a  sequence  spikes  only  once,  then  the  sequence  itself  is  an 
ordering  of  its  active  neurons.  In  such  a  case,  the  center-of-mass  ordering  equals  the 
original  sequence.  A  broader  class  of  sequences  induce  an  ordering  in  a  similarly  un¬ 
ambiguous  way.  For  instance,  in  the  sequence  3311222,  every  spike  of  neuron  3  comes 
before  every  spike  of  neuron  1;  further,  every  spike  of  neuron  1  occurs  before  every 
spike  of  neuron  2.  Hence,  3311222  induces  the  unambiguous  order  312.  Although 
this  ordering  can  be  computed  with  centers  of  mass,  the  ordering  is  also  immediately 
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obvious  upon  inspection  of  the  sequence  since  all  of  the  spikes  from  each  neuron  are 
grouped  together.  We  give  a  name  to  sequences  such  as  this. 

Definition  2.12.  Disjoint  sets  I,  J  C  N  are  interlaced  if  there  exist  i,i'  G  /  and 
j,f  G  J  such  that  i  <  j  and  %'  >  f. 

Definition  2.13.  A  sequence  s  is  pure  if  the  spike  sets  of  the  active  neurons  are  not 
interlaced. 

In  a  pure  sequence,  all  of  the  spikes  from  one  neuron  must  follow  (or  precede)  all 
of  the  spikes  from  any  other  neuron.  In  particular,  the  indices  of  the  spikes  of  each 
individual  neuron  form  a  contiguous  set  of  natural  numbers.  To  get  a  sequence  that 
is  not  pure  from  one  that  is  pure,  we  could  transpose  adjacent  spikes  from  different 
neurons.  For  example,  the  pure  sequence  111122223333  can  be  transformed  into  the 
impure  sequence  111212223333  by  transposing  the  fourth  and  fifth  spikes  in  the  pure 
sequence.  In  a  sense,  this  new  sequence  is  minimally  impure:  The  spikes  from  neuron 
3  are  still  contiguous,  and  only  one  pair  of  spikes  from  neurons  1  and  2  are  “out 
of  order.”  This  perspective  suggests  that  the  purity  of  a  sequence  (1)  is  somehow 
related  to  the  pairwise  relationships  between  the  spikes  of  different  neurons  and  (2) 
is  dependent  on  the  collection  of  all  such  pairs  of  neurons.  In  other  words,  the  purity 
of  a  sequence  depends  on  the  bias  of  each  pair  of  neurons  to  fire  in  a  particular  order. 
We  begin  to  quantify  these  biases  with  the  bias-count  matrix  below. 

Definition  2.14.  The  bias-count  matrix  of  the  sequence  s  is  the  matrix  C(s)  := 
[cu(s)]jjeAt  where  Cy(s)  :=  |T.q(s)|  andL^(s)  :=  {(k,k')  |  k  <  k',  sk  =  i,  and  sk>  =  j} 
In  words,  Cy( s)  is  the  number  of  length- 2  subsequences  of  s  that  are  equal  to 

Example  2.15.  Consider  the  sequence  s  =  (1,  2, 4, 4, 4, 4,  3, 3, 1,  3, 1, 1,  2, 4)  from  Ex¬ 
ample  fig: running-example.  Note  that  Li(s)  =  {1,9,11,12}  and  /^(s)  =  {2,13}. 
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Hence,  we  see  that 

L12(s)  =  {(1,  2),  (1, 13),  (9, 13),  (11, 13),  (12, 13)} 

and 

L21(s)  =  {(2,9),(2,11),(2,12)}; 

so  ci2(s)  =  |Li2(s)|  =  5  and  c2i(s)  =  |L2i(s)|  =  3.  In  a  similar  way  for  the  other 
entries  Cy( s),  we  find  that  the  bias-connt  matrix  for  s  is 

5  4  8" 

1  3  6 

3  3  3' 

4  12  10 

A  few  simple  facts  about  the  bias-count  matrix  are  immediate.  First,  since  the 
definition  of  the  bias-count  matrix  arose  out  of  a  discussion  of  pure  sequences,  it 
is  worth  noting  that  the  bias-count  matrix  completely  captures  whether  or  not  a 
sequence  is  pure. 

Lemma  2.16.  A  sequence  s  with  supp(s)  =  A f  is  pure  if  and  only  ifC( s)  has  exactly 
(^)  non-zero,  off-diagonal  entries. 

Proof.  Any  C( s)  will  have  at  least  (^)  non-zero,  off-diagonal  entries.  If  C(s)  has 
more,  then,  by  the  pigeonhole  principle,  there  exist  distinct  neurons  i,j  G  A f  such 
that  Cij  >  0  and  >  0.  So  we  must  have  that  the  spikes  of  i  and  j  are  interlaced, 
meaning  that  s  is  not  pure.  □ 

Next,  one  cannot  always  recover  a  sequence  from  its  bias-count  matrix.  Consider, 
for  example,  si  =  1221  and  s2  =  2112.  Then  C(si)  =  C(s2)  =  [l  ?].  Hence,  thought 
of  as  a  map,  C  :  S  — >■  is  not  injective.  Though  we  cannot  recover  s  from 
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C(s),  we  can  recover  the  spike  counts  c*(s).  To  see  this,  first  note  that 


Since  /  :  N  — *  N  by  fin)  =  ("')  is  injective,  it  has  an  inverse;  therefore, 

1  i  _ 

Ci(s)  =  /_1(cii(  s))  =  -  +  -V1  +  8cii(s). 


Last,  note  that  ctj  +  =  CiCj.  In  words,  ct]  +  is  equal  to  the  product  of  the 

number  of  spikes  of  i  and  of  j.  This  is  easy  to  see  because  we  just  choose  one  index 
for  i  and  one  index  for  j  (which  can  be  done  in  CjCj  ways);  in  each  such  choice,  either 
i  follows  j  ( Cji )  or  j  follows  i  (c-j,). 

Though  C (s)  encapsulates  the  idea  of  purity,  it  does  so  in  a  somewhat  unsatisfac¬ 
tory  way.  The  sequences  123  and  112233,  for  instance,  are  both  pure  sequences  with 
the  same  neuronal  ordering,  yet  their  bias-count  matrices 


(7(123)  = 

'o  i  r 

0  0  1 

and  (7(112233)  = 

O 

4^  4^ 

_ l 

o 

o 

o 

lo  0  ll 

are  quite  different.  The  issue  is  that  112233  has  more  pairs  of  spikes  than  123  does. 
We  can  correct  for  this  by  normalizing  each  entry  by  an  appropriate  value.  How  likely 
is  neuron  j  to  follow  neuron  i?  There  are  Cij  index  pairs  in  which  j  follows  i  out  of  a 
total  of  CiCj  possible  index  pairs.  This  leads  us  to  the  definition  of  a  second  type  of 
bias  matrix. 

Definition  2.17.  The  probability-bias  matrix  of  a  sequence  s  is  the  matrix  £>prob(s)  :  = 
Ms)kleAt  where  %( s)  := 

Example  2.18.  The  probability-bias  matrix  for  the  sequence  s  =  12444433131124 
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from  Figure  2.1  is 


-Sprob(s) 


f6/l6 

00 

lie''' 

4/l2 

^00 

to 

o 

00 

1 

00 

V3 

1 - 

10 

cq'''' 

00 

V4 

3/6 

6/l0 

3/8 

V4 

V2 

3/5 

8/l2 

3/6 

3/9 

3/ 15 

2/3 

V2 

y3 

V5 

to 

o 

4/lO 

12/l5 

10/25 

3/5 

2/5 

4/5 

2/5_ 

The  probability-bias  matrix  maps  the  set  S  of  sequences  into  the  matrices 
[0, 1]IA/’IXIA/1,  and  it  does  so  in  a  way  that  each  off-diagonal  entry  of  the  matrix  has  a 


natural  interpretation  in  terms  of  how  likely  it  is  to  see  a  pair  of  spikes  in  a  given 


order.  As  such,  it  should  hold  that  b^  +  bji  =  1.  Recalling  that  cl}  +  c ^  =  qcj,  we 
see  that  this  indeed  holds: 


_|_  J:)  _  Cij  _|_  Cji  _  Cij  +  Cji  _  CiCj  _  ^ 

CiCj  CjCi  CiCj  CiCj 

Stated  simply,  when  i  and  j  both  spike  in  a  sequence,  each  pair  of  spikes  must  occur 
in  one  order  or  the  other.  As  defined,  this  likelihood  is  global  in  terms  of  the  sequence, 
which  gives  it  a  sort  of  skew-symmetric  nature:  b^  =  1  —  bji.  As  such,  it  is  clear  that 
not  every  point  in  the  hypercube  [0,  l]^x^  can  be  obtained  as  a  probability-bias 
matrix.  Though  that  much  is  known  simply  because  no  irrational  coordinate  can 
ever  be  hit  by  the  map,  the  above  relation  shows  that  at  least  one  further  relation  is 
imposed  on  this  image.  In  a  sense,  this  relation  suggests  that  the  bias  matrix  is  still 
not  quite  in  the  right  form:  Since  + bjt  =  1,  we  see  that  bij  —  \  =  \~~-bji  =  —{bp/—  |). 
Because  the  relation  comes  from  the  fact  that  the  probability  biases  are  probabilities, 
the  entries  are  centered  at  whereas  we  might  more  naturally  think  of  a  non-bias 
(i.e.,  a  maximally  unbiased  bias)  as  being  equal  to  0.  Similarly,  we  might  want  the 
sign  of  a  bias  to  convey  the  tendency  of  neurons  to  spike  in  the  given  order  (a  positive 
bias)  or  the  opposite  order  (a  negative  bias).  Perhaps  we  then  care  more  about  the 
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difference  between  bij  and  bji  than  we  do  about  each  of  them  individually.  This  leads 
us  to  define  our  final  version  of  the  bias  matrix. 

Definition  2.19.  The  skew-bias  matrix  of  the  sequence  s  is  the  matrix  Bs kew(s)  := 
where  fij(s)  :=  btj( s)  -  Ms)  = 

Note  that  when  i  /  j ,  /// s)  =  b^  —  bji  =  bVJ  —  (1  —  bVJ )  =  2bt]  —  1.  The  skew  biases 
are  now  centered  at  zero  and  have  a  sign  that  indicates  the  direction  of  the  bias. 
Further,  the  skew-bias  matrix  is  skew-symmetric  as  a  matrix  since  =  b^  —  bji  = 

~(bji  —  b^)  =  —(3ji- 

Example  2.20.  The  skew-bias  matrix  for  the  sequence  s  =  (1,  2, 4, 4, 4, 4,  3,  3, 1,  3, 1, 1,  2, 4) 
from  Figure  2.1  is 

V4  -1/3  -1/5 

0  0  -Vs 

0  0  —3/5  ' 

Vs  3/s  0 

Here,  it  is  obvious  at  a  glance  that,  for  instance,  neuron  2  tends  to  follow  neuron  1 
and  that  neurons  2  and  3  are  maximally  unbiased  toward  each  other. 

2.4  Higher-order  biases 

When  creating  a  mathematical  model,  it  is  important  to  question  what  information 
the  model  discards.  The  pairwise  biases  from  the  previous  section  attempt  to  capture 
the  tendency  of  pairs  of  neurons  to  fire  in  a  particular  order.  While  this  certainly 
preserves  some  information  about  complex  relationships  between  neurons/spikes,  it 
may  at  times  discard  more  information  than  is  desired. 

Example  2.21.  Consider  the  sequences  S!  =  123321  and  s2  =  321123.  These  se- 


-®skew(s) 


0 

-1/4 

V3 

V5 
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quences  are  indistinguishable  by  pairwise  biases: 

'1  2  2' 

C(Sl)=C(s2)=  2  12. 

.2  2  1_ 

Note  that,  since  the  probability-  and  skew-bias  matrices  are  computed  from  the  bias- 
count  matrix,  these  matrices  also  cannot  distinguish  between  S]  and  s2. 

We  might  want,  however,  for  the  sequences  from  the  above  example  to  be  dis¬ 
tinguishable  using  some  bias-like  method.  This  may  or  may  not  be  possible  for  all 
pairs  of  sequences,  but  we  can  extend  the  idea  behind  the  pairwise-bias  matrices  to 
consider  more  neurons  within  each  individual  bias.  For  instance,  we  can  count  the 
subsequences  that  are  equal  to  each  permutation  of  123  (where,  before,  we  counted 
the  subsequences  equal  to  each  permutation  of  12).  We  call  these  higher-order  biases. 

Definition  2.22.  For  a  list  v  =  (iq, . . .  ,i>k)  of  k  distinct  neurons,  we  define 
Lv( s)  :=  {(ii,  •  •  • ,  4)  Mi  <  •  •  •  <  ik  and  stj  =  v3  for  all  j  G  [k]}. 
to  be  the  collection  of  subsequences  of  s  that  are  equal  to  v  and  define 

Cv(s)  •  |hv(s)| 

to  be  the  count  of  those  subsequences;  we  call  cv(s)  an  order-k  bias  count.  Moreover, 
analogously  to  the  bias-count  matrix,  we  define  the  order-k  bias-count  vector 

■  cvi(s)  ■ 

Cfc(s)  :=  : 

.c'v(8). 

where  vi,...,Vfc/  is  the  list  of  length- k  biases  in  lexicographical  order  and  k'  = 
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(W\h,\—  AT  _  I  \ '| 

{  k  )K-~  (.A'  -A)!  “  lyV  l(fc)’ 

Example  2.23.  Continuing  the  previous  example,  let  Si  =  123321  and  S2  =  321123. 
Note  that  L23i(si)  =  {(2,3,6),  (2,4,6)}  and  L2 31 (s2)  =  0.  In  particular,  c23i(si)  =  2 
but  c23i(s2)  =  0.  Hence,  the  order-3  biases  can  distinguish  between  the  sequences  Si 
and  s2. 

Note  that  cv  is  a  generalization  of  and  that  c,j  =  cuj)  =  cv  for  v  =  (1,  2).  This 
also  unihes  the  definitions  of  the  spike-count  and  the  bias-count  ctf,  that  is,  c%  and 
Cij  are  order-1  and  order-2  biases,  respectively.  But  this  added  information  comes  at 
a  cost:  The  number  of  order-A;  biases  is  0(\J\f\k).  With  this  increase  in  the  number  of 
biases,  it  is  valuable  to  see  that  a  higher-order  bias  vector  contains  at  least  as  much 
information  about  a  sequence  as  a  lower-order  conterpart. 

Lemma  2.24.  The  collection  of  order-{k  —  1)  biases  can  be  computed  from  the  col¬ 
lection  of  order-k  biases  if  the  spike  counts  are  also  known. 

Proof.  Let  s  €  S  be  given,  and  let  v  =  (iq, . . .  ,  tq_i)  E  J\fk~l  be  a  list  of  distinct 
neurons.  We  want  to  compute  the  order-(/c  — 1)  bias  cv(s)  from  the  collection  of  order- 
k  biases.  Recall  that  cv(s)  is  the  number  of  times  that  v  appears  as  a  subsequence  of 
s.  Let  U  be  the  collection  of  length- A:  sequences  of  distinct  neurons  and  of  which  v 
is  a  subsequence.  To  make  this  more  precise,  define  the  sets  V  =  {tq, . . . ,  v^-i}  and 
/  =  A f\V.  We  can  now  write  U  as 

U  =  { v  [mj  •  v  ■  V{m+i,...,k}  |  v  e  N,  me  [A;]}  . 

We  want  to  double-count  the  number  d  of  subsequences  of  s  that  are  equal  to  some 
element  of  U .  First,  by  definition  of  d  and  the  bias  count  cu(s),  we  have  that  d  = 
Sue(7  W(s). 
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To  count  in  another  way,  let  B  =  {JveN  Lv(s)  be  the  collection  of  indices  of  s 
that  are  not  spikes  from  the  set  V.  Now,  each  subsequence  of  s  that  is  equal  to  an 
element  of  U  has  the  form  Sju{mj  for  some  m  £  B  and  some  J  C  [£]  with  s j  =  v; 
moreover,  each  such  J  and  m  yields  such  a  subsequence.  As  there  are  cv(s)  such  J 
and  ^2veN  c„(s)  such  m,  we  find  that  d  =  cv(s)  cu(s)-  Finally,  putting  together 

the  two  expressions  for  d ,  we  find  that 


This  completes  the  proof.  □ 

Similarly  to  the  pairwise  biases,  we  may  not  care  about  subsequence  counts  cv 
so  much  as  the  likelihood  that  the  neurons  iq, . . . ,  vk  will  appear  in  the  order  v  = 
(iq, . . . ,  Vk )  provided  that  they  all  appear. 

Definition  2.25.  Given  a  list  v  =  (iq, . . . ,  vk)  of  distinct  neurons,  define  the  order-k 
probability-bias  to  be  the  number  bv(s )  :=  (s).  The  order-k  probability-bias 

vector  is  the  vector 

'  ^vds)  ' 

Bk{ s)  :=  : 

where  Vi, . . . ,  vk>  is  the  list  of  length- k  biases  in  lexicographical  order. 

Note  that  the  denominator  of  6V( s)  is  the  total  number  of  length- A;  subsequences 
of  s  that  contain  all  neurons  V\ , ,vk',  together  with  the  fact  that  cv(s)  is  a  count  of 
such  sequences,  we  see  that  bv(s)  is  really  the  proportion  of  all  length- A;  subsequences 
of  s  containing  v1} . . . ,  vk  that  are  equal  to  v. 

Example  2.26.  Let  s  =  12444433131124.  Then  Li(s)  =  {1,9,11,12},  L2{ s)  = 
{2, 13},  and  L3(s)  =  {7,  8, 10}.  So  we  see  that  Li23(s)  =  {(1,  2,  7),  (1,  2,  8),  (1,  2, 10)} 
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and  L132(s)  =  {(1,  7, 13),  (1,  8, 13),  (1, 10, 13),  (9, 10, 13)};  hence, 

ci23(s)  —  |-^i23(s)|  —  3  and  Ci32(s)  =  |L132(s)|  =  4. 


Since  ci(s)  =  4,  c2(s)  =  2,  and  c3(s)  =  3,  we  also  have  that 


bl23(s)  — 


Cl23(s) 


Ci(s)c2(s)c3(s)  8 


1  ,  ,  /  x  Ci23(s) 

and  6123(s)  = 


ci(s)c2(s)c3(s)  6' 


Similarly,  we  can  compute  the  remainder  of  the  order-3  bias-count  and  probability- 
bias  vectors.  The  vectors  are  shown  as  matrices  here  strictly  for  formatting  purposes: 


Cl23(s) 

^•213  (®) 

C312 (s) 

C412 (s) 

"3 

1 

8 

12" 

cm(s) 

C214(s) 

C314(s) 

Ql3(s) 

9 

3 

8 

4 

Cl32 (s) 

C23l(s) 

C321 (s) 

C421 (s) 

4 

8 

0 

0 

Cl34(s) 

^234 (s) 

C324 (s) 

C423 (3) 

4 

3 

3 

0 

Cl42 (s) 

C24l(s) 

C341 (s) 

C431 (s) 

4 

12 

0 

32 

_Cl43(s) 

C243(s) 

C342 (s) 

C432(s)_ 

12 

12 

0 

12 

^123 (®) 

6213(3) 

6312(3) 

6412(3) 

V8 

y  24 

V3 

3/l0 

&124(s) 

6214 (s) 

6314(3) 

6413(3) 

9/40 

3/  40 

2/l5 

yis 

6l32(s) 

6231 (s) 

6321 (s) 

6421 (s) 

V6 

V3 

0 

0 

6l34(s) 

6234 (s) 

6324(3) 

6423 (s) 

Vl5 

yio 

yio 

0 

6l42 (3) 

6241 (s) 

6341 (s) 

6431(3) 

V10 

3/l0 

0 

8/l5 

_6l43 (s) 

6243(3) 

6342(3) 

6432 (s)_ 

_y5 

2/5 

0 

2/5  _ 

We  define  no  higher-order  version  of  the  skew-bias  matrix  Bs^ew  because  the  skew- 
nature  of  -Bskew  arose  from  the  fact  that  only  two  probabilities  were  involved  for  each 
pair  of  neurons,  that  is,  from  the  fact  that  btj  +  bJt  =  1.  In  the  higher-order  version, 


a  more  complicated  relationship  exists,  namely 


<jGPerms({m,...,n/c}) 


for  each  set  of  distinct  neurons  {ni, . . . ,  rik}  Q  M ■ 
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Chapter  3 


Bias  Matrices 


In  the  previous  chapter,  we  formalized  the  concept  of  a  neuronal  sequence  and  intro¬ 
duced  the  pairwise  biases  associated  with  those  sequences.  These  pairwise  biases — of 
which  we  have  three  separate  formulations — capture  the  tendency  of  pairs  of  neurons 
to  fire  in  a  particular  order.  As  the  main  drive  of  the  discussion  in  the  previous 
chapter  was  to  motivate  and  define  these  biases,  it  is  worth  taking  more  space  here  to 
investigate  further  properties  of  these  matrices.  Recall  that  we  defined  the  bias-count 
matrix  C(s)  =  [cij\ij£j^,  the  probability-bias  matrix  Rprob(s)  =  an(l  the 

skew-bias  matrix  Rskew(s)  =  AT- 

Lemma  3.1.  Let  s  and  s'  be  sequences,  and  consider  the  following  statements: 

1.  C(s)  =  C(s') 

2.  Bproh(^ s)  Bprob(s  ) 

3-  Bskew(s'j  Bskew[s  ) 

Then  statements  (1)  and  (2)  are  equivalent.  Further,  (1)  and  (2)  imply  (3). 
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Proof.  First,  note  that  that  (1)  implies  (2)  since  -Bprob  was  defined  solely  in  terms  of 
C.  Similarly,  (2)  implies  (3)  since  -Bskew  was  defined  solely  in  terms  of  Hprob.  All  that 
remains  to  be  shown  is  that  (2)  implies  (1).  To  see  this,  note  that  the  spike  count 
Cj(s)  can  be  recovered  from  the  diagonal  entry  ba( s)  of  Bpro b(s): 


bu{  s) 


Cjj(s) 

q(s)q(s) 


(Ci(s))  _  1  /  1  \  1 

q(s)q(s)  2  V  Ci(s ))  ^  Cl  S  1  —  26jj(s) ' 


Knowing  each  spike  count  now  allows  us  to  recover  any  entry  of  (7(s): 


Ms)  =  :  7^77-T  ==►  Cii(s)  =  6<i(s)ci(s)ci(s). 

CiV*)Cj 

Hence,  C(s)  =  C( s')  if  Hprob(s)  =  Hprob(s').  Therefore,  (2)  implies  (1),  as  claimed. 
This  completes  the  proof.  □ 

Note  that  statement  (3)  from  the  above  lemma  is  not  equivalent  to  statements  (1) 
and  (2).  This  is  easily  seen,  for  example,  with  the  pure  sequences  12  and  1122: 


-£>skew(12)  —  Hskew(1122)  — 


but 


(7(12) 


0  1 
0  0 


7^ 


1  4 
0  1 


(7(1122). 


The  reason  for  this  is  that  Hskew(s)  does  not  contain  sufficient  information  to  recover 
the  spike  counts  since  ^(s)  =  0  for  all  neurons  i  6  Af  and  all  sequences  s. 
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3.1  Relationship  to  center-of-mass  vectors 

Bias  matrices  were  introduced  in  an  attempt  to  deal  with  issues  associated  with 
center-of-mass  vectors,  but  these  two  concepts  were  reached  through  very  different 
approaches.  Still,  a  surprising  relationship  exists  between  them:  The  center-of-mass 
vector  center (s)  can  be  computed  from  the  bias-count  matrix  C(s). 

Lemma  3.2.  If  s  G  S  is  a  sequence  and  i  G  supp(s)  is  an  active  neuron,  then 
center j(s)  =  1  +  ^  E?=i  ci*(s)- 

Proof.  Let  I  =  len(s).  Also,  let  Cj*\ s)  denote  the  number  of  times  neuron  j  spikes 
before  the  k-th  spike  in  the  sequence  s  provided  that  sk  =  i : 

c-f(s)  :=  \{k'  <  k  |  sk>  =  j  and  sk  =  i}|. 


Observe  that 


^2cji\s)  =  Ms)  and 
k= 1 


{k  -  1,  Sfc  =  i\ 
0,  sk  ±  i. 


Further,  since  we  know  C(s),  we  know  Ca( s)  for  each  i  G  A f.  Recalling  that  Qj(s)  = 
(Ci2S')),  we  note  that  c,(s)  =  \  +  |a/1  +  8cj;(s).  Moreover,  knowing  Qj(s)  for  each 
i  G  J\f  allows  us  to  find  I  as  I  =  EieV  c*(s)-  Putting  these  facts  together,  we  see  that 

n  n  i 

j= 1  j=l  fc=l 

£  n 

-EE* 

fc=i  i=i 
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=  E  (*=-!) 

fceLi(s) 

=  ^2Li{s)  -  Ci(s) 

=  Cj(s)  [center  j(s)  —  1], 

Because  q(s)  ^  0  by  assumption,  we  can  solve  for  center j(s)  to  obtain  the  desired 
result.  □ 

In  particular,  the  above  lemma  says  that  center(si)  =  center (S2)  if  C(si)  =  C(s2). 
However,  the  converse  to  this  statement  is  not  true.  That  is,  the  center-of-mass  vector 
center(s)  does  not  contain  enough  information  to  compute  the  bias-count  matrix.  For 
example,  consider  the  sequences  Sx  =  123123  and  s2  =  211332.  Then 

'YT 

center(si)  =  center(s2)  =  7/  2 

_9/2. 

but 

'1  3  31  [1  2  4" 

C  (si)  =113^212=  C(  s2). 

.1  1  1J  Lo  2  1. 

As  such,  the  bias-count  matrix  C(s)  is  in  fact  a  generalization  of  the  center-of-mass 
vector  center (s). 

One  might  also  suspect  that  the  center-of-mass  vector  q(s)  can  be  recovered 
from  the  skew-bias  matrix  Hskew(s).  However,  upon  closer  inspection  of  the  previ¬ 
ous  lemma,  we  notice  that  the  bias-count  matrix  was  used  to  recover  the  spike-count 
vector,  which  cannot  be  done  when  starting  with  the  skew-bias  matrix.  As  the  next 
proposition  shows,  however,  we  can  reconstruct  the  center-of-mass  vector  q(s)  from 
the  skew-bias  _Bskew(s)  if  we  also  know  the  sequence’s  firing-rate  vector  p( s). 
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Proposition  3.3.  If  s  e  S  is  a  sequence,  then 

7(s)  =  Bskew(s)Tp(s). 

Proof.  Let  I  =  len(s),  and  assume  that  i  G  supp(s).  Recall  that  7*(s)  =  2ccnte^(s)~1  _ 
1.  from  which  we  see  that 


center*  (s)  =  -[7*(s)  +  1]  +  -. 


We  now  want  to  rewrite  the  previous  lemma  to  give  us  an  expression  for  7*(s)  in 
terms  of  the  skew-biases  /3*j( s).  Dropping  the  s  dependence  from  the  notation,  the 
above  equation  and  the  previous  lemma  yield  the  following: 


-(7*  +  1)  +  -  =  center* 


']i 


3= 1 


- 1  +  -  (C**  +  y 


=  i  +  -(c^pP+T,cfL((>»+v 


c*  V  2 
1 


j¥=i 


—  i+o(Q-i+y^  cj{Pji + 1) 


j¥=i 


1  +  V  ( [c*  +  ^  Cj]  -  1  +  Y  ciPp 


3+i 


3  ¥=*■ 


1+2V  1  +  ^  C3^P)  > 

3=1 


where  we  have  recalled  in  the  last  line  that  /?**  =  0  and  Xq=i  cj  =  Solving  for  7* 
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and  including  the  s  dependence  into  our  notation,  we  obtain 

n 

7 'i(s)  = 

3= 1 

since  Pi(s)  =  Note  that  although  we  derived  this  equation  using  the  assumption 
that  Cj(s)  >  0,  it  still  holds  if  q(s)  =  0  because  in  this  case  we  have  7i(s)  =  0  and 
/3ij( s)  =  0  for  all  j.  Since  ^”=1  /3ji(s)pj(s)  is  the  entry  in  row  i  of  -Bskew(s)T p(s) ,  this 
completes  the  proof.  □ 

If  we  substitute  the  firing-rate  vector  p(s)  in  Proposition  3.3  for  an  arbitrary  firing- 
rate  vector,  then  the  skew-bias  matrix  can  be  used  to  generate  other  center-of-mass 
vectors  and  orderings. 

Definition  3.4.  Given  a  sequence  s  and  a  firing-rate  column  vector  p  =  (p i, . . .  ,pn), 
define  the  center  of  mass  of  s  induced  by  p  to  be  7 s(p)  :=  Rskew(s)rp.  Further, 
define  the  corresponding  center-of-mass  ordering  of  s  induced  by  p,  denoted  us(p), 
just  as  with  the  center-of-mass  ordering  of  a  sequence:  If  7 s(p)  =  (pi, . . . ,  gn),  then 
o-s(p)  =  (ii,  •  •  •  ,in)  where  {4, . . .  ,in}  =  [n]  are  such  that  ^(s)  <  •  •  •  <  gin( s)  and, 
in  the  case  that  gik( s)  =  pifc+1( s),  such  that  '4  <  4+i- 

Example  3.5.  Recall  that  the  sequence  s  =  12444433131124  has  skew-bias  matrix 

0  1/4  -Vs  -1/5" 

-!/4  0  0  i/s 

Vs  0  0  -3/5 

Vs  Vs  3/s  0 


The  firing-rate  vector  of  s  is  p(s)  =  -b  [4,  2,  3,  5]  and  cr(s)  =  (4,  2, 1, 3).  With  firing-rate 


vectors  p\  =  |[1, 1, 1, 1]  and  p-2  =  |[1, 1, 1,  0],  we  find  that 


17/240 

4 

y36 

4 

7s  (Pi)  = 

Vso 

Vl5 

— >  <js(pi)  = 

2 

3 

and 

7s  (P2)  = 

yi2 

-y9 

- >  o-s(p2)  = 

3 

1 

1 

-1/3. 

2 

So,  pi  and  p2  induce  the  orderings  (4, 2,  3, 1)  and  (4,  3, 1,  2),  respectively.  In  particular, 
notice  that  the  activation  of  neuron  4  changed  the  induced  ordering  of  {1,  2,  3}. 

In  the  case  of  a  pure  sequence,  this  process  can  result  in  only  one  neuronal  ordering, 
as  the  following  lemma  shows. 

Lemma  3.6.  If  s  is  a  pure  sequence,  then  &s(p)  =  a( s)  for  each  firing-rate  vector 
p  —  (pi, . . . ,  pn)  satisfying  Pi  >  0  for  each  i  G  A f . 

Proof.  Without  loss  of  generality,  we  can  assume  that  s  =  <r(s)  since  Bs kew(s)  = 
-Bskew(cr(s))  whenever  s  is  pure.  For  each  neuron  i  e  [n  —  1],  we  know  that  f3jSi( s)  = 
/Wn(s)  f°r  each  J  G  A/"\  {si,Si+i}  since  s  is  pure  and  since  ,st  and  Sj+i  are  adjacent 
spikes  (not  equal  to  j)\  additionally,  we  know  that  /3SiSi( s)  =  /3Si+lSi+1(s)  =  0  and  that 
PsiSi+1{ s)  =  1  and  PSi+lSi( s)  =  -1.  Letting  qs(p)  =  (71,..., qn),  these  facts  tell  us 
that 

n  n 

7  si =  'y  ]  Pjsi(s)pj  <  y  'j  /3jsi+1(s)pj = 7si+i  ■ 

3= 1  3=  1 

Hence,  regardless  of  the  choice  of  p,  we  always  have  that  qSl  <  •  •  •  <  jSn .  In  particular, 
letting  p  =  p( s),  we  find  that  crs(p(s))  =  cr(s),  thus  completing  the  proof.  □ 

3.2  Permutations  of  sequences 

To  understand  a  bias  matrix  A,  it  would  be  useful  to  know  which  sequences  give  rise 
to  that  bias  matrix;  that  is,  we  would  like  to  be  able  to  describe  the  set  B~^ew(A)  C  S. 
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We  can  “fill  in”  part  of  this  set  if  we  know  a  sequence  in  that  set,  that  is,  a  sequence 
s  such  that  Bs kew(s)  =  A. 

Consider  the  sequence  s  =  12321  and  its  bias-count  matrix 


C(s) 


'1  2  1' 
2  11 
1  1  0 


If  we  flip  any  two  adjacent  spikes,  the  result  in  the  bias-count  matrix  will  be  that 
exactly  two  entries  are  modified.  For  instance,  if  we  flip  the  first  two  spikes,  the 
bias-count  matrix  becomes 


C{  21321) 


'i  i  r 

3  11 
110 


Since  we  want  the  bias-count  matrix  to  remain  unchanged  if  we  are  to  find  another 
sequence  in  -B^gW(-Bskew(s)),  we  can  try  to  undo  the  effect  of  the  first  flip  by  flipping 
another  pair  of  spikes;  that  is,  since  we  made  a  subsequence  of  the  form  12  into 
one  of  the  form  21,  we  can  look  for  a  subsequence  of  the  form  21  to  change  into 
12.  Here,  the  last  two  spikes  are  21;  changing  these  and  the  first  two,  we  get  the 
sequence  s'  =  21312.  Now,  C( s')  =  C(s).  So,  indeed,  it  is  sometimes  possible  to  find 
another  sequence  with  the  same  biases  by  simply  permuting  the  spikes  of  the  original 
sequence.  It  will  be  convenient  to  have  some  notation  and  terminology  for  such  an 
operation: 

Definition  3.7.  For  a  sequence  s  =  (si,. . . ,  sf)  and  a  permutation  tt  e  Perms  ([!']), 
define  the  shuffling  of  s  by  n  to  be  s71"  :=  (s^p), . . . ,  s^p). 

Example  3.8.  Let  s  =  1212333.  The  following  table  shows  permutations  of  [len(s)]  = 
[7]  along  with  the  corresponding  shuffled  sequence. 
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Permutation 

Shuffled  sequence 

7T!  =  (1  7)  (2  6)  (3  5) 

s^1  =  3332121 

vr2  =  (2  3) 

s^2  =  1122333 

tt3  =  (1  2)  (3  4) 

=  2121333 

7T4  =  (1  3)  (5  6  7) 

s71"4  =  1212333 

Note  that  the  non-trivial  permutation  7t4  induces  a  shuffled  sequence  s71'4  that  is  equal 
to  the  original  sequence  s. 

Definition  3.9.  For  a  sequence  s  =  (si, . . . ,  sf)  and  a  collection  c o  of  neurons,  define 
the  restriction  of  s  to  u  to  be  s|w  :=  where  I w  :=  Uiea;L,:(s). 

Example  3.10.  Let  s  =  12444433131124,  and  let  =  {1,2}  and  uj2  =  {3,4}.  Then 
swi  =  121112  and  sW2  =  44443334. 

Lemma  3.11.  Let  s  =  (s4, . . . ,  sf)  be  a  sequence,  and  let  n  =  {k\  k\  +  l)(k2  k2  +  1) 
be  a  permutation  such  that  sfcl  =  Sk2+ 1  and  Sk2  =  Sfc1+1.  Then  C(s)  =  C(s7r). 

Proof.  Let  s  =  (si,...,s^)  be  a  sequence,  and  let  n  =  (. k\  k\  +  1) (fc2  ^2  +  1)  be 
a  permutation  of  [l]  such  that  s^  =  Sk2+ 1  and  Sk2  =  s^+i-  Let  i±  =  sg.,  €  A f 
and  i2  =  Sfc2  £  A/”,  and  let  ./V  =  Af  \  Observe  that  s|at  =  s71"^  since  7 r 

fixes  the  indices  of  the  spikes  from  neurons  in  N]  in  particular,  this  tells  us  that 

cjij2is)  =  O1I2 (® I Jv)  =  cjij2(s  I Jv)  =  ciij2(s  )• 

We  now  want  to  show  that  Cy( s)  =  CjJ-(s7r)  and  c,j(s)  =  Cji( s77)  if  {i,  j }fl{zi,  i2}  7^  0- 
Recalling  that  ctJ  =  LtJ  | ,  we  proceed  by  listing  the  relationships  between  index  pairs 
in  the  sets  and  Ljt .  Suppose  that  me  [£]  \  {/sq,  /c4  +  1,  fc2,  fc2  +  1}  is  a  spike  index 
and  that  j  :=  sm  €  N.  Then  the  following  relationships  hold: 

(k\,  m)  e  Lhj( s)  (/ci  +  l,  m)  e  Lhj( s77) 

(k2  +  1, m)  e  L^-(s)  (A2,m)  e  Lili(s7r) 

(fei  +  l,m)GLi2j(s)  (/ci,m)  e  L*2i(s7r) 
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(k2,m)  G  Li2j( s) 
(m,  h)  G  Ljh( s) 
(m,  k2  +  1)  G 
(m, /ci  +  1)  G  Liia(s) 
(m,  k2)  G  Liia(s) 
(fci,  +  1),  (Aa,  k2)  G  (s) 
(Ah  +  1,  k2  +  1),  (k2,  k2  +  1)  G  Li2ii  (s) 


{k2  +  1,  m)  G  Lizj^s77) 

(m,  k\  +  1)  G  -A'ji1(s7r) 

(m,  fc2)  G  L^s") 

(m,  h)  G  L^s*) 

(m,  k2  +  1)  G  Lji2(sn) 

(ki  +  1,  k2  +  1),  (Ah,  k2  +  1)  G  Lili2(sn) 
(Ah,  ki  +  1),  (Ah,  k2 )  G  -^2U  ) 


This  completes  the  proof  since  each  line  above  indicates  that  the  corresponding  index- 
pair  sets  are  of  the  same  size  for  both  s  and  sT  □ 

Since  a  sequence  of  such  permuations  can  be  applied  recursively,  one  might  wonder 
whether  all  sequences  with  equal  bias-count  matrices  can  be  obtained  from  a  single 
sequence  by  repeated  application  of  such  permutations.  This  is  not  the  case:  For 
example,  if  s  =  123231  and  s'  =  231123,  then  C*(s)  =  C( s')  but  no  such  permutation 
7T  =  (i  i  T  l)(j  j  T  1)  exists  such  that  s'  =  sn  since  there  is  no  pair  {i,j}  with  i  ^  j 
such  that  Si  =  Sj+i  and  Sj+i  =  Sj.  More  succinctly,  if  Bpro b(s)  =  fiprob(s'),  it  is  not 
necessarily  the  case  that  s'  =  s7'  for  some  permutation  ir. 

We  can  also  describe  a  broader  class  of  permutations  that  fixes  the  center-of-mass 
vectors  of  a  given  sequence.  Generally  speaking,  elements  of  this  class  will  not  fix  the 
corresponding  bias  matrices. 

Lemma  3.12.  If  s  =  (si, , . . ,  sf)  is  a  sequence  and  n  —  (Ah  Ah  +  1)  •  •  •  (km  km  +  1)  G 
Perms ([/])  with  ka+ 1  >  ka  +  1  for  all  a  G  {1, . . .  ,m  —  1},  then  center(s)  =  center(s7r) 
*/  {^fci  >  •  •  • ,  Skm  }  {^fei+i i  ■  r  ■  i  ■Sfcm+ 1 }  and  |  ,  •  •  • ,  Sfcm  }  |  m. 

Proof.  Suppose  that  i  G  {si,...,Sfc}.  Then  i  =  Skb  =  Skd+ 1  for  some  b,  d  G  [m\. 
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Noting  that  Lj(s7r)  =  [Lj( s)  \  { +  1}]  U  {kb  +  1,  A^},  we  see  that 


center*  (s)  =  center,;  (s)  + 
=  center,;  (s)  + 
=  center  (s71"). 


[kb  +  (. kd  +  1)]  —  [kb  +  (kd  +  1)] 


cAs 


[{kb  +  1)  +  kd]  —  [kb  +  {kd  +  1)] 
ci(s7r) 


If  i  then  centers)  =  center*^77)  since  L*( s71")  =  L*(s).  Hence, 

center, (s)  =  centers77)  for  all  i  G  A/”,  thus  completing  the  proof.  □ 

Example  3.13.  Let  s  =  123123  and  n  —  (1  2)  (3  4)  (5  6).  Then  s  and  n  meet  the 
stipulations  of  the  previous  lemma  since  {si,s3,s5}  =  {s2,.S4,s6}  =  {1,2,3}.  Noting 
that  s77  =  211332,  we  see  that 


center(s)  =  center(s77) 


72- 

5/2 

_9/2. 


3.3  The  fundamental  poly  tope 

The  bias  matrix  arose  in  an  attempt  to  capture  the  tendency  of  pairs  of  neurons  to 
fire  in  a  particular  order.  These  tendencies  can  be  computed  from  any  individual 
sequence,  but  the  pairwise  nature  of  the  biases  hints  at  an  underlying  network  of 
neurons  with  pairwise  preferences  for  spiking  in  particular  orders.  From  this  per¬ 
spective,  a  sequence  is  really  just  a  (noisy)  manifestation  of  the  neural  network’s 
biases.  As  such,  it  may  be  useful  to  understand  the  properties  that  characterize  all 
bias  matrices.  By  considering  the  biases  that  arise  from  sequences,  we  aim  to  derive 
a  bias-matrix  formulation  that  is  independent  of  any  individual  sequence.  In  this 
section,  we  develop  such  a  formulation  and  conjecture  that  each  collection  of  biases 
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in  this  formulation  is  realizable  as  the  collection  of  biases  of  an  actual  sequence. 

We  begin  by  consideration  of  the  bias-count  matrix  C(s)  =  [cjj(s)]jje_v.  We 
already  know  a  few  properties  that  must  be  satisfied: 

•  Its  entries  are  natural  numbers:  cV]  G  No  for  all  i,j  G  Af. 

•  Its  diagonal  entries  are  binomial  coefficients:  cu  =  ('2')  for  all  i  G  Af. 

•  For  all  neurons  i,j  G  Af  with  i  ^  j,  Cij  +  Cji  =  ctcr 

But  we  moved  from  the  bias-count  matrix  to  the  skew-bias  matrix  because  we  want 
the  individual  bias  entries  to  have  meaning  independent  of  other  entries.  When  the 
above  properties  are  translated  into  the  skew-bias  matrix,  we  find  only  two  properties: 

•  It  is  skew-symmetric:  fit]  =  —f3ji  for  all  i,j  G  Af. 

•  Its  entries  are  bounded  and  rational:  /3ij  G  [—1, 1]  D  Q  for  all  i,j  G  Af. 

Though  the  properties  listed  here  do  not  fully  characterize  the  skew-biases  in  general 
(as  we  will  see  below),  they  do  provide  a  convenient  starting  place  from  which  we  can 
further  restrict  as  we  look  for  additional  properties.  The  skew  nature  of  the  matrices 
.Bskew(s)  allows  us  to  describe  the  matrix  using  only  the  above- diagonal  entries  of  each 
matrix. 

Definition  3.14.  The  bias  space  23  (Af)  of  the  neuron  set  Af  =  [n]  is  the  (")- 
dimensional  hypercube  [—1, 1](2)  with  coordinates  indexed  by  neuron  pairs  (i,  j)  G  Af2 
with  i  <  j.  When  s  is  a  sequence,  the  vector  [Ai(s)]*<j  consisting  of  the  upper- 
triangular  portion  of  Bs kew(s)  is  the  corresponding  point  in  the  bias  space.  We  will 
abuse  terminology  and  refer  to  Bs kew(s)  as  being  in  the  bias  space  for  notational/ter- 
minological  convenience. 
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Definition  3.15.  A  point  x  G  53 (A f)  is  obtainable  if  x  is  in  the  closure  of  the  set 
{-Bskew(s)  |  s  G  5}. 

Which  points  in  53  (A f)  are  obtainable?  Let  us  first  consider  one  of  the  simplest 
classes  of  sequences:  those  with  exactly  two  active  neurons.  In  this  case — that  is, 
when  A f  =  [2] — the  only  bias  that  we  consider  is  /3i2',  further,  the  bias  space  is  just 
the  interval  [—1, 1].  Here,  we  find  that  the  biases  are  actually  dense  in  the  bias  space. 

Lemma  3.16.  For  any  x  G  [—1, 1]  and  any  e  >  0,  there  exists  a  sequence  s  G  S  such 
that  supp(s)  =  [2]  and  |/?i2 (s)  —  x\  <  e. 

Proof.  Let  x  G  [—1, 1]  and  e  >  0  be  given.  Choose  l  G  N  such  that  <  e,  and  pick 
k  G  N  as  k  =  [|(1  +  x)(£  —  1)  +  lj .  Note  that  0  <  1  +  x  <  2  since  —  1  <  x  <  1;  so, 
0  <  \(l-\-x)(l—  1)  <  £  —  1  and,  thus,  k  G  [£].  Moreover,  | k  —  |(1  —  x)£\  <  1  by  choice 
of  k.  Define  the  sequence  s  =  (si, . . . ,  sf)  by  Sk  —  2  and  sm  —  1  for  m  G  [£]  \  {k}. 
Now,  C12 (s)  —  k  —  1,  which  implies  that  612 (s)  =  =  fzy  and  that 

Ms)  =  2612(s)  -  1  =  -  1.  Now, 


|/?12(s) 


—  X  \  — 


2k -2 


—  1  —  x 


£- 1 

2k -2-  (l+x)(£-l) 


< 


-  1 
2 


^  _  (1(1  +  x )(^  _  1)  +  l)  | 


<  e, 


thus  completing  the  proof. 


□ 


But  it  is  not  generally  the  case  that  every  point  in  the  bias  space  is  obtainable. 
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To  get  some  intuition  about  why  some  points  are  not  obtainable,  consider  the  matrix 


A  = 


'  0  1-1' 
-10  1 
1  -1  0 


Suppose  that  A  =  Bskew( s)  for  some  sequence  s.  Because  all  off-diagonal  entries  are 
in  {1,  —1},  we  must  have  that  s  is  a  pure  sequence.  Considering  the  entries  more 
closely,  we  also  find  the  following: 

•  (0'i2  =  1)  All  spikes  from  neuron  1  precede  all  spikes  from  neuron  2. 

•  (®23  =  1)  All  spikes  from  neuron  2  precede  all  spikes  from  neuron  3. 

•  (®i3  =  —  1)  All  spikes  from  neuron  3  precede  all  spikes  from  neuron  1. 

The  first  two  points  (a  12  =  023  =  1)  tell  us  that  all  spikes  from  neuron  1  must 
precede  all  spikes  from  neuron  3.  But  this  is  a  contradiction  to  the  third  observation! 
Hence,  we  cannot  have  that  A  =  Bskew(s)  for  any  sequence  s.  Indeed,  we  will  see  in 
Corollary  3.19  that  A  is  not  obtainable. 

Although  we  have  not  yet  shown  that  A  is  not  obtainable,  the  point  is  well-taken: 
There  are  non-trivial  relationships  among  the  entries  f3ij,  fijk ,  and  fiik  (that  is,  among 
the  collection  of  biases  between  the  neurons  i,  j,  and  k).  In  fact,  the  above-observed 
property  holds  in  a  more  general  case:  If  s  e  S  is  pure,  then  we  must  have  that 
fiik  =  1  if  fiij  =  fijk  =  1  for  distinct  neurons  i,j,k  G  A f.  The  logic  for  seeing  that 
this  holds  is  exactly  the  same  as  in  the  motivating  example.  As  seen  in  the  proof 
of  Lemma  3.16,  only  one  spike  from  the  second  neuron  was  necessary  to  obtain  (in 
the  sense  of  density)  all  possible  biases  between  exactly  two  neurons.  As  such,  we 
begin  our  investigation  of  biases  among  three  neurons  by  considering  sequences  with 
exactly  one  spike  from  a  third  neuron. 
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Lemma  3.17.  For  any  sequence  s  £  5  with  distinct  neurons  i,j,k  G  supp(s)  arid 
with  Cfc(s)  1?  bik(l  F  bl3  F  1  bjk{\  buf). 

Proof.  Since  neuron  k  spikes  only  once,  it  is  easy  to  construct  a  sequence  that  min¬ 
imizes  min  for  fixed  (rational)  values  of  b^  and  bki ■  Since  bki  =  —  =  —  and 
bkj  =  ^fr.  =  we  see  that  =  bkiCi  and  Ckj  =  bkjCj  are  proportions  of  the  total 
number  of  times  that  neurons  i  and  j  spike,  respectively.  In  particular,  when  consid¬ 
ering  the  restricted  sequences  containing  k  and  only  one  other  neuron,  we  find  that 
there  is  only  one  possibility  for  each  restricted  sequence: 

s| {i,k}  =  iCik  ■  k  ■  iCki  and  s\{jM  =  pk  ■  k  ■  fk\ 

where  we  recall  that  the  symbol  ■  represents  concatenation.  In  joining  these  restricted 
sequences  to  form  the  full  sequence  s,  we  minimize  bij  by  placing  j  before  i  whenever 
possible,  resulting  in  the  following: 

s  =  jc*k  ■  iCik  ■  k  ■  jCkj  ■  iCki. 

It  is  now  straightforward  to  count  cl3 : 

Cij  CikCkj  ( bjkCj )  (bkjCj  )  bjk  (  1  bjk)CiCj. 

Because  s  was  chosen  to  minimize  cl3 ,  dividing  by  the  product  c,Cj  yields  one  of  the 
desired  inequalities:  b^  <  6*^(1  —  bjk).  Similar  reasoning  allows  us  to  maximize  cl3 
with  s': 

s'  =  iCik  ■  jCjk  ■  k  ■  iCki  ■  jCkj , 
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which  results  in  the  count 


Qj  CjfcCj  ~t~  CkiCkj 

bikC-iCj  ~l~  ipkiCi)(pkjCj) 

{pik  “7  ^kPkj  )  CO 
[^ifc  T  (1  ^jfc)]O0 

(1  bjk  bikbjk)C-iCj 
[1  ^ifc)]QO> 


thus  yielding  the  other  desired  inequality:  >  1  —  6^(1  —  This  completes  the 

proof.  □ 

As  it  turns  out,  however,  not  every  point  in  the  bias  space  53 ([3])  is  obtainable 
using  only  a  single  spike  from  the  third  neuron.  Consider,  for  example,  the  sequence 
s  =  3123  and  its  bias  matrix 


T^skew  (  S  ) 


'0  10" 
-10  0 
0  0  0 


Suppose  that  there  is  some  sequence  s'  such  that  Cs(s')  =  1  and  5skew(s')  =  Bs kew(s). 
Since  /^(s')  =  1,  the  restricted  sequence  s'|[2]  must  be  pure.  But  then  the  single  spike 
from  neuron  3  cannot  fall  in  the  middle  of  the  spikes  for  neuron  1  and  in  the  middle 
of  the  spikes  for  neuron  2.  Hence,  no  such  s'  can  exist.  Still,  not  all  points  in  the  bias 
space  are  obtainable  even  when  we  consider  arbitrary  sequences  on  three  neurons. 
But  this  should  not  come  as  too  much  of  a  surprise:  Our  intuition  for  sequences 
says  that  if  i  follows  j  and  k  follows  j,  then  we  would  expect  for  k  to  follow  i.  The 
inequalities  in  the  lemma  below  are  restrictions  of  such  a  form. 
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Proposition  3.18.  For  any  sequence  s  with  neurons  i,j,k  G  supp(s)  such  that  i  < 
j  <  k,  the  inequalities  0  <  bi3  +  bjk  —  hk  <  1  hold. 

Proof.  Let  s  =  (si, . . . ,  sf)  be  a  sequence.  First,  note  that  the  following  equality  holds 
because  every  triple  £i,£j,£k  G  [£]  with  .sy  =  i,  sy  =  j ,  and  s?k  =  k  (of  which  there 
are  CiCjCk)  will  produce  exactly  one  of  the  six  orderings  ijk,  jki,  kij,  ikj,  jik,  and 
kji: 

(-  ijk  T  Cikj  Cjik  T  (  jki  T  ('kij  T  C kji  CiCjCk 

We  now  show  that  the  following  relationships  exist  between  pairwise-bias  counts 
and  third-order  bias  counts: 


(-  k ('ij  (-ijk  Cikj  H~  (-  kij 

(  'i (-  }  k  C-j ki  +  Cji^  F  ('ijk 
CjCki  Ckij  -|-  Ckji  ('jki 

To  see  that  these  relationships  hold,  consider  the  first  one.  For  each  subsequence  of 
s  that  is  equal  to  (i,j) — of  which  there  are  Cy — each  of  the  occurrences  of  k  in 
s  can  produce  only  one  of  the  triples  ( i,j,k ),  ( i,k,j ),  and  ( k,i,j );  moreover,  each 
such  combination  must  produce  one  such  triple.  Hence,  the  equality  holds.  Similar 
arguments  work  for  each  of  the  other  two  equations. 

Summing  the  above  equations,  we  get  the  following  inequality: 


CkCij  T  CiCjk  T  CjCki  —  ( Cj/j k  T  (  jki:  T  Ckij )  T  ( Cikj  T  Cjik  T  Ckji) 


—  ( Cjjj k  T  (  jki  T  Ckij  T  ('ikj  T  Cjik  T  Ckji) 


(('ikj  T  Cjik  T  Ckji) 


2 CiCjCk  ( Cikj  T  Cjik  T  Ckjf) 
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Dividing  the  resultant  inequality  by  CiCjCk  yields  the  inequality  %  +  bjk  +  bki  <  2.  By 
considering  the  sum  of  the  same  equations  again,  we  find  another  inequality: 


CfcCjj  d"  Cj Cj +  CjCki  2 ( Cjj k  d"  Cjki  “I-  C-kij )  d"  (CjfcJ  d~  (jib  d"  Ckji) 


\pijk  d*  Cjki  d"  t'ki/j  d"  (ikj  d"  t  jik  d"  Ckji) 


d~  \C-ijk  d*  t'jki  ~\~  Ckij) 


CiCjCk  d"  \C-ikj  d~  Cjik  d"  Ckji) 
—  CiCjCk. 


Dividing  this  by  CjCjCfc  yields  the  inequality  bij  -f  bjk  +  bki  >  1.  Putting  these  two 
inequalities  together,  we  have  that  1  <  bij  -f  bjk  +  bki  <  2.  Recalling  that  bki  =  1  —  bik, 
this  becomes  0  <  bl3  +  bjk  —  b^  <  1,  thus  completing  the  proof.  □ 

Corollary  3.19.  For  any  sequence  s  with  neurons  i,j,  k  G  supp(s)  sach  that  i  <  j  < 
k,  the  inequalities  —1  <  /3ji  +  fdkj  —  ^  <  1  hold. 

As  sequences  become  less  pure,  our  intuition  about  one  neuron  “following”  another 
breaks  down,  and  we  run  into  tendencies  that  would  be  contradictory  in  the  case  of 
a  pure  sequence. 


Example  3.20.  Consider  the  sequence  s  = 

is 


Fskew(s) 


0 

-1/6 

.1/3 


2331112123.  The  skew-bias  matrix  of  s 

1/6  -1/3' 

0  1/9  . 

-1/9  0 


Upon  inspection,  we  see  that  (1)  neuron  1  tends  to  precede  neuron  2,  (2)  neuron  2 
tends  to  precede  neuron  3,  and  (3)  neuron  3  tends  to  precede  neuron  1. 
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Inequalities  of  the  form  in  Corollary  3.19  essentially  ensure  that  the  pairwise- 
biases  among  any  three  neurons  are  consistent  with  each  other  (i.e.,  if  j  follows  i  and 
k  follows  j,  then  k  follows  i),  a  sort  of  pseudo-transitivity  that  is  guaranteed  when 
biases  arise  from  an  actual  sequence.  Further,  the  collection  of  all  such  inequalities 
carves  out  a  polytope  in  the  bias  space,  which  we  will  call  the  consistent-bias  polytope. 
Because  three  distinct  neurons  were  required  for  this  corollary,  there  are  (^)  such 
inequalities,  each  of  which  is  really  two  distinct  inequalities.  Since  the  bias  space 
itself  is  defined  by  inequalities  of  the  form  —1  <  <  1  with  i  <  j,  this  polytope  can 

be  described  by  a  total  of  2(^)  +  2 (^)  inequalities: 

•  fiij  +  f3jk  -  /3ik  <  1  for  all  i  <  j  <  k, 

•  -/%  -  Pjk  +  f3ik  <  1  for  all  i  <  j  <  k, 

•  Pij  <  1  for  all  i  <  j, 

•  —  Pij  <  1  for  all  i  <  j. 

The  inequalities  from  Corollary  3.19  can  be  rearranged  to  bound  the  bias  of  i  to 
fire  before  k  if  the  other  biases  are  known:  (3^  +  /3jk  —  1  <  j3ik  <  +  /3jk  +  1.  This 

inequality  is  the  direct  equivalent  to  the  intuition  that  k  should  follow  i  if  k  follows 
j  and  j  follows  i.  But  longer  chains  of  this  form  do  not  create  inequalities  that  are 
more  restrictive  than  those  arising  from  such  chains  of  length  three.  For  considering 
possible  biases  among  more  than  three  neurons,  we  start  by  looking  at  the  simplest 
of  such  biases:  those  arising  from  pure  sequences. 

Definition  3.21.  A  pure-bias  matrix  is  a  matrix  A  such  that  A  =  -Bskew(s)  f°r  some 
pure  sequence  s  G  S. 

For  each  pure-bias  matrix  A,  there  is  a  sequence  s  with  one  spike  per  neuron  such 
that  Bs kew(s)  =  A.  Let  us  start  by  considering  the  sequence  s  =  (1,2,... , n).  Here, 
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we  see  that  i  precedes  j  whenever  i  <  j;  as  such,  the  corresponding  entries  of  Bs kew(s) 
are  /3p(s)  =  1  and  f3ji( s)  =  —1.  For  example, 


£?skew(1234) 


0  111 
-10  11 
-1-10  1 
-1  -1  -1  0 


Indeed,  up  to  row  and  column  permutations,  the  bias  matrix  of  each  pure  sequence 
is  of  this  form,  as  the  following  lemma  shows. 

Lemma  3.22.  Suppose  that  s  e  S  is  pure.  Then  Bskew(s)  =  P~XAP  for  A  = 
Bskew(  12  •  •  •  n)  and  some  permutation  matrix  P. 

Proof.  Let  s  e  S  be  pure.  Without  loss  of  generality,  we  can  assume  that  s  = 
(si,...,sn)  has  one  spike  per  neuron  since,  for  a  pure  sequence,  it  is  clear  that 
-Bskew(s)  =  Pskewi^is)).  Note  that  /3Si,Sj( s)  =  (3^(12  ■■ -ri)  since  st  precedes  Sj  in 
s  if  and  only  if  i  precedes  j  in  12-  •  •  n.  Dehne  7 r  G  Perms  (A/")  by  7r  (sj)  =  i,  and  let 
P  =  \Pij\ijeAf  be  the  permutation  matrix  with  entries 


Pij 


{1,  7 r(j)  =  i; 

0,  otherwise. 


Letting  C  =  P  1AP  =  [cij]ij£_\r  and  D  =  AP  =  [dij]ijej^r,  we  find  that 


^  ^  Pk,Sidk,s 


k= 1 
n 


^  yPk,Sj  I  ^ 
k= 1  \£=1 

n 

Pk,Si^k,n(sj) 

k=  1 
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an(si),n(sj) 

=  ttij 

=  Ai(12  ■■■n) 

—  flsi,Sj  (s)- 

Therefore,  Bskew(s)  =  P~lAP  as  claimed,  completing  the  proof.  □ 

Pure  sequences  are  extreme  from  the  perspective  of  biases.  Indeed,  each  bias 
corresponding  to  a  pure  sequence  is  a  vertex  of  the  the  bias  space  hypercube.  As  we 
have  already  seen  in  this  section,  not  all  vertices  of  the  bias  space  are  legitimate  biases. 
We  now  know  exactly  which  vertices  correspond  to  legitimate  biases.  Moreover,  we 
have  a  simple  count  for  how  many  vertices  are  obtainable  biases:  There  is  one  for 
each  permutation  of  the  neuron  set  J\f  =  [n] ;  that  is,  n!  of  the  2  W  bias-space  vertices 
are  obtainable  biases. 

But  we  can  glean  more  from  this  investigation  of  pure  biases.  Consider  two  pure 
sequences  with  one  spike  per  neuron  that  are  as  close  to  each  other  as  possible,  that 
is,  sequences  that  differ  by  a  single  transposition  of  spikes.  For  the  sake  of  example, 
we  will  consider  s  =  1234  •  •  •  n  and  s'  =  s*d  2)  =  2134  ■  ■  ■  n.  From  the  perspective  of 
the  neurons  in  A f  \  [2] ,  neurons  1  and  2  both  behave  exactly  the  same  (since  both 
1  and  2  precede  all  other  spikes  in  both  s  and  s').  Many  other  sequences  share  this 
property  with  s  and  s';  in  fact,  we  can  completely  characterize  the  collection  of  all  such 
sequences  as  {(si, . . . , sg,  3, 4, . . . , n)  \  (s\, ...  ,Si)  €  <S([2])}.  That  is,  we  can  prepend 
an  arbitrary  sequence  from  the  neuron  set  [2]  to  the  beginning  of  the  sequence  34  •  •  •  n. 
But  we  know  something  about  the  biases  arising  from  the  sequences  in  <S([2]):  Every 
bias  is  obtainable!  This  tells  us  that  the  entire  line  joining  s  to  s '  in  the  bias  space  is 
obtainable  as  a  bias.  This  is  similarly  true  for  any  such  pair  of  “adjacent”  sequences 
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Figure  3.1:  When  A f  =  [3],  the  fundamental  polytope  of  S  is  3-dimensional.  Above, 
this  polytope  is  shown  below  from  two  angles.  In  this  case  of  only  three  neurons, 
the  fundamental  polytope  is  equal  to  the  consistent-bias  polytope.  Note  that  n  =  3 
is  a  special  case:  These  polytopes  are  not  equal  in  general.  Since  they  are 
equal  here,  we  can  easily  write  down  the  hyperplanes  that  serve  as  its  boundaries: 
— 1  A  Al2  +  /?23  —  /?i3  A  1,  —1  <  Pi2  A  1,  —1  <  fd 13  A  1,  and  —1  <  $23  <  1 


with  one  spike  per  neuron.  More  surprisingly,  each  obtainable  bias  is  in  the  convex 
hull  of  the  collection  of  pure  biases.  Before  showing  this  claim  to  be  true,  we  name 
this  polytope  and  show  its  relationship  to  the  consistent-bias  polytope. 

Definition  3.23.  The  fundamental  polytope  of  S  is  the  convex  hull  of  the  set  of  pure 
biases  arising  from  sequences  in  S. 


Lemma  3.24.  The  consistent-bias  polytope  contains  the  fundamental  polytope. 

Proof.  Since  the  vertices  of  the  fundamental  polytope  are  obtainable  biases,  they  must 
satisfy  the  constraints  that  specify  the  consistent-bias  polytope;  moreoever,  since  both 
polytopes  are  convex,  we  must  have  that  the  fundamental  polytope  is  contained  in 
the  consistent-bias  polytope.  'Jtf 

Theorem  3.25.  If  s  e  S  with  supp(s)  =  J\f ,  then  the  bias  corresponding  to  s  is 
contained  in  the  fundamental  polytope. 
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Proof.  Let  t  =  12  ■  ■  -  n,  and  let  Pn  =  {Bs\iew(tn)  \  n  G  Sn}  be  the  collection  of  pure- 
bias  matrices.  Also,  let  s  =  (si, . . . ,  St)  be  any  sequence  with  supp(s)  =  A f.  We  need 
only  to  show  that  -Bskew(s)  is  a  convex  combination  of  the  elements  of  Pn.  For  each  sub¬ 
sequence  s'  of  s  that  is  equal  to  tn  for  some  7 r  G  Perms ([n]),  we  have  that  Bs kew(s')  G 
Pn  by  definition  of  Pn.  Let  Sub(s)  —  {IQ  [•£]  |  s  7  =  t77  for  some  n  G  PermsQn])},  and 
define  the  sum  D  =  [dtJ]  to  be 

I 

D  =  T7W  -^skew  (S/)- 

il?;=i  CASJ  /eSub(s) 

Because  Bs kew(s/)  G  Pn,  the  proof  will  be  finished  if  D  =  L?skew(s)  and  if  the  coeffi¬ 
cients  of  its  summands  sum  to  1. 

To  see  that  the  latter  of  these  requirements  holds,  first  note  that  all  coefficients 
are  equal.  Moreover,  there  are  exactly  nr=ic*(s)  summands  in  the  definition  of  D : 
Choosing  a  subsequence  of  s  that  is  and  ordering  of  A f  is  the  same  as  choosing  one 
index  from  each  of  the  sets  L.f  s)  for  i  G  A f,  which  can  obviously  be  done  in  n"=i  c(s) 
ways  since  Cj(s)  =  |L*(s)|  for  each  i  G  A f. 

To  see  that  D  =  £>skew(s),  consider  an  arbitrary  entry  dl} : 


dij 


I  _ 


nLu*(s) 

1 

nLi^(s) 


/£Sub(s) 

(  £  1+  £  - 

JeSub(s)  /gSub(s) 

C;j(s/)  =  1  Cji  (s/ )  —  1 


Cjj  (  S  )  Cjj  ( s 


Cjj[S)  Cji[  s) 
Ci{a)cj{s) 


=  Pij(  s) 
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Hence,  we  see  that  the  fundamental  polytope  contains  all  biases.  □ 

Though  all  biases  are  contained  in  the  fundamental  polytope,  the  question  of 
whether  or  not  the  biases  are  dense  in  the  fundamental  polytope  is  still  an  open 
question. 

Question  3.26.  Is  every  point  in  the  fundamental  polytope  obtainable? 

3.4  The  bias  network  of  a  sequence 

Neuronal  templates  are  appealing  because  they  provide  a  simple  combinatorial  picture 
of  neuronal  sequences,  but  this  representation  is  lacking  in  many  aspects.  The  bias 
matrix  can  be  used  to  define  a  more-robust  combinatorial  representation  of  these 
sequences. 

Definition  3.27.  Let  A  =  [ciij\ij£M  be  a  bias  matrix.  We  define  the  bias  network 
of  A  to  be  the  simple  digraph  G(A)  with  vertext  set  Af  and  edge  set  E(A)  :  = 
{( i,j )  |  aij  >  0}.  The  bias  network  of  a  sequence  s  is  G( s)  :=  G(-Bskew(s))  with  edge 
set  E( s)  =  E(Bskew( s)). 

Definition  3.28.  A  tournament  is  a  complete  oriented  graph. 

Definition  3.29.  A  digraph  D  with  edge-set  E  is  transitive  if  (wi ,  v3)  e  E  whenever 
(vi,v2),  (v2,v3)  E  E. 

The  vertices  of  a  transitive  tournament  can  be  totally  ordered  by  reachability 
(i.e. ,  vertex  v  precedes  vertex  u  in  the  ordering  if  (v,u)  is  an  edge).  The  following 
lemma  shows  one  way  in  which  bias  networks  generalize  neuronal  templates:  by 
unambiguously  inducing  a  total  ordering  in  the  case  of  a  pure  sequence. 
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Sequence  s 


Bias  network - >  Neuronal  ordering 


Recurrent  network  model 


Network  synfire  chain 


Figure  3.2:  The  bias  network  model.  This  diagram  represents  how  the  bias  net¬ 
work  serves  as  a  generalization  of  neuronal  templates.  Starting  with  a  sequence  s, 
the  center-of-mass  vector  and  ordering  can  be  computed  directly  (right  side).  Alter¬ 
nately,  the  same  vectors  can  be  computed  via  the  bias  matrix;  this  also  allows  for  the 
generation  of  other  orderings  by  varying  the  firing-rate  vector  p  used  to  generate  the 
center-of-mass  vector.  The  dashed  line  across  the  middle  indicates  that,  in  the  case 
of  pure  sequences,  the  center-of-mass  ordering  is  recoverable  from  the  bias  network, 
as  shown  in  Lemma  3.30.  At  the  bottom  level,  these  two  combinatorial  objects  can 
be  used  to  generate  sequences  using  recurrent  networks  and  network  synfire  chains, 
topics  beyond  the  scope  of  this  paper. 


Lemma  3.30.  If  A  is  a  pure-bias  matrix,  then  G(A )  is  a  transitive  tournament. 

Proof.  Let  J\f  =  [n] .  If  n  =  1,  then  the  claim  is  trivially  true.  Let  s  be  a  sequence 
with  Rskew(s)  =  A,  and  suppose  that  n  >  1.  Since  s  is  pure,  some  neuron  i  e  M  must 
spike  first.  Then  s |a/"\{*}  is  a  pure  sequence  on  n  —  1  neurons.  By  induction  on  n,  we 
have  that  G(5skew(s|A/'\{*}))  is  a  transitive  tournament.  Because  i  precedes  all  other 
neurons,  we  have  that  =  1  for  each  j  e  J\f\  {i}.  In  particular,  this  means  that 

the  graph  G(Rskew(s))  =  A  is  transitive,  completing  the  proof.  □ 

As  a  result  of  Lemma  3.30,  it  is  clear  that  all  possible  transitive  tournaments  can 
be  realized  as  the  bias  network  of  some  sequence.  But  it  is  not  obvious  which  other 
digraphs  are  realizable,  and  it  would  be  surprising  if  a  sequence  could  be  constructed 
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to  match  an  arbitrary  list  of  pairwise  orderings  (as  a  simple  digraph  specifies). 
Question  3.31.  Which  digraphs  are  realizable  as  the  bias  network  of  some  sequence? 

The  following  examples  explore  this  question. 

Example  3.32.  The  sequence  s  =  12444433131124  has  skew-bias  matrix 

0  1/4  -Vs  -Vs" 

— 1/4  0  0  -1/5 

V  3  0  0  -3/5 

Vs  Vs  3/5  0 

This  allows  us  to  easily  see  that  G( s)  has  adjacency  matrix 

"OlOO" 

0  0  0  0 
1  0  0  0' 

1110 

Graphically,  we  can  visualize  G( s)  as  below. 

Q-H2> 


©—5 


Notice  that  G(s)  is  not  a  tournament  because  it  does  not  contain  an  edge  between 
vertices  2  and  3,  which  happened  because  /Vs(s)  =  /?32(s)  =  0. 


Example  3.33.  The  sequence  s  =  2331112123  has  bias  network  G( s)  with  adjacency 
matrix 

'0  1  O' 

0  0  1. 

1  0  0 


Graphically,  G(s)  is  shown  below. 
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So  G( s)  is  clearly  not  transitive  since  (1,2),  (2,3)  G  E( s)  but  (1,3)  ^  E( s). 

The  answer  to  Question  3.31  is  surprising:  We  have  control  over  each  individual 
tendency  in  a  bias  network. 

Theorem  3.34.  Every  bias  network  is  possible. 

To  prove  this  theorem,  we  first  need  some  other  results  and  definitions. 

Definition  3.35.  The  transpose  of  a  sequence  s  =  (si, . . . ,  sg)  is  the  sequence  s7  = 

(sg,  st-i, . . . ,  si). 

Note  that  the  transpose  of  the  transpose  is  the  original  sequence;  that  is  (s7  )T  =  s 
for  any  sequence  s  G  S. 

Lemma  3.36.  For  any  sequence  s,  Bskew( sT)  =  Bske «,(s)T. 

Proof.  Consider  neurons  i,j  G  A f .  For  each  index  pair  (k,  k')  G  Ly( s),  we  must  have 
that  (k',k)  G  Lji( sT).  Similarly,  for  each  index  pair  (■ m,m ')  G  Ly( sT),  we  must  have 
that  (■ m',m )  G  Lji( s).  Hence,  Cy( s)  =  |Ljj(s)|  =  |Ljj(sT)|  =  Cji( sT)  for  arbitrary 
neuron  pairs.  Now, 


Cji{ ST)  ~  Cjj( sT) 
Cji{  ST)  +  Cij(  ST) 


Cjj(s)  Cji(  s)  of 
Cij(s)  +  Cji(s)  -  1  ij[ 


-Pji( s)- 


In  particular,  Bs kew(s7)  =  — -Bskew(s)  =  -Bskew(s)T  since  Bs kew  is  skew-symmetric,  thus 
finishing  the  proof. 


□ 
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Lemma  3.37.  Given  a  sequence  s,  a  sign  value  a  G  {  —  1, 1},  and  distinct  neurons 
i,j  G  A f,  there  is  a  sequence  s'  such  that 

1.  sign (fiij (s'))  =  a  and 

2.  sign(/39r(s'))  =  sign(/3gr(s))  for  {q,r}  ±  {i,j}. 

Proof.  Recall  that,  for  any  distinct  neurons  i.j  G  Af,  Bn  =  ClJ  .  Since  c„-  +  cn 
is  a  count  of  neuron  pairs,  it  is  positive;  in  particular,  this  tells  us  that  sign(/9u)  = 
sign(Qj  —  Cji).  Now,  let  s  G  S  be  given  and  fix  i,j  G  Af  as  distinct  neurons. 

Since  Rskew  is  skew-symmetric,  assume  without  loss  of  generality  that  a  —  1  (by 
exchanging  i  and  j  if  a  =  —1).  Pick  k  G  N  such  that  2 k2  >  Cji( s)  —  Qj(s),  and 
define  s'  :=  ik  ■  jk  ■  s  •  ik  ■  jk.  We  will  first  show  that  Aj(s')  >  0,  implying  that 
sign(/3jj(s'))  =  1  =  a.  Note  that  Cy( s')  =  3 k2  +  k  ■  Cj(s)  +  q(s)  •  k  +  Cy(s)  and  that 
Cjj (s')  =  k2  +  k-Ci(s)  +Cj(s)  •  k  +  Cji(s).  Now,  Cjj(s)  —  Cjj(s)  =  2k2  +  Cy(s)  —  Cj*(s)  >  0, 
as  claimed. 

Let  q,r  G  Af  be  distinct  with  {q,r}  ^  Suppose  that  {q,r}  D  {i,j}  =  0. 

Then  (3qr(s')  =  /dqr(s'\x\{i,j})  =  Pqr(s\M\{i,j})  =  Pqr( s).  In  particular,  sign(/3(?r(s'))  = 
sign(/3gr(s)).  Now,  suppose  that  q  G  {i,j}  but  r  {i,j}-  Then  c9r(s')  =  k  ■  cr(s)  + 
cqr( s)  and  crq( s')  =  crq( s)  +  cr(s)  •  k]  hence,  cqr( s')  —  crq( s')  =  cqr( s)  —  crq( s).  In 
particular,  sign(/39r(s'))  =  sign(cqr(s')  -  crq( s'))  =  sign(c(?r(s)  -  crq{ s))  =  sign(/3?r(s)). 
Therefore,  the  lemma  holds.  □ 

Definition  3.38.  A  sequence  s  =  (si, . . .  ,S()  is  a  palindrome  if  Sk  =  S£_k+ 1  for  all 
k  G  [£]. 

Lemma  3.39.  If  s  G  S ( [n] )  is  a  palindrome,  then  Bskew( s)  =  0n,  where  0„  is  the 


n  x  n  zero  matrix. 
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Proof.  Let  s  =  (si, . . . ,  sf).  If  t  is  even,  then  s  =  uuT  for  some  sequence  u.  For  any 
i,j  G  AT,  Cij( s)  =  Cij{ u)  +  Ci(u)cj(u)  +  Cji( u)  and  c^(s)  =  c^(u)  +  Cj(u)cj(u)  +  c^-(u); 
in  particular,  Ai(s)  =  =  °- 

If  l  is  odd,  then  s  =  u/cuT  for  some  sequence  u  and  some  neuron  k  G  A f.  For 
any  i,j  k,  the  center  spike  of  k  does  not  change  the  bias  /^-(s);  hence,  we  are 
reduced  to  the  previous  case.  Also,  Qfc(s)  =  Qfc(u)  +  C;(u)(l  +  Cfc(u))  +  cki(u)  and 
Cfci(s)  =  Cfci(u)  +  (1  +  cfc(u))cj(u)  +  Ci fc(u);  again,  we  see  that  /3ik( s)  =  =  0. 

Hence,  the  claim  holds.  □ 

We  are  now  ready  to  show  that  every  simple  digraph  is  realizable  as  a  bias  network. 

Proof  of  Theorem  3.3 f.  Let  D  be  an  arbitrary  digraph.  Then  D  has  skew-symmetric 
adjacency  matrix  A  G  {—1,0,  l}nxn.  We  want  to  show  that  there  exists  a  sequence  s 
on  neuron  set  A f  =  [n]  such  that  Bs kew(s)  =  A. 

Let  sq  be  a  palindrome  with  supp(s0)  =  A f.  By  Lemma  3.39,  we  know  that 
Hskew(so)  =  0 n  is  the  n  x  n  zero  matrix.  Define  /  :=  {(i,j)  G  A fs  \  al3  >  0}  to  be 
the  set  of  index  pairs  at  which  A  is  positive,  and  let  /  =  {(H,  ji),  •  •  • ,  ( im,jm )}  be  an 
enumeration  of  I.  We  proceed  by  induction  on  m.  If  m  =  1,  then  A  has  only  one 
positive  entry  alxjx .  Since  s0  is  the  zero  matrix,  Lemma  3.37  provides  a  sequence  Si 
such  that  Am(si)  =  1  and  Pqr( si)  —  0  for  all  q,r  G  A f  with  {q,r}  ^  that  is 

^skew(Sl)  A. 

Suppose  now  that  m  >  1,  and  assume  for  induction  that  there  is  a  sequence 
Sm-i  such  that  sm_i)  =  =  0  and  ^r(sm_i)  =  aqr  for  all  other 

q,  r  G  A f.  Lemma  3.37  now  provides  a  sequence  sm  such  that  ffmJm  (sm)  =  1  and 
/3qr(sm)  =  (3qr( sm_i)  for  all  other  q,  r  G  A f.  Then  Bs kew(sm)  =  A.  Hence,  by  induction, 
the  lemma  holds  for  any  m  >  0.  □ 
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Chapter  4 

Comparing  sequences 

4.1  Correlation  between  sequences 

In  the  brain,  it  is  often  the  case  that  signals  are  very  noisy  (due  to  neuronal  mis¬ 
fires,  improper  spike  sorting,  etc.);  as  a  result,  comparing  sequences  directly  to  each 
other  may  not  be  so  effective.  Additionally,  complications  arise  when  one  wants,  for 
instance,  to  compare  relatively  long  neuronal  sequences  (e.g.,  place-held  sequences) 
to  relatively  short  neuronal  sequences  (e.g.,  SWR  sequences):  How  does  one  mean¬ 
ingfully  (and  generically)  compare  a  sequence  with  10  spikes  to  a  sequence  with  100 
spikes?  Even  if  a  meaningful  comparison  were  developed  to  directly  compare  the  se¬ 
quences,  this  comparison  would  likely  be  computationally  expensive  since  these  tech¬ 
niques  usually  involve  modifying  one  sequence  until  it  becomes  the  other  sequence. 
As  we  attempt  to  compare  sequences  in  a  less  direct  way,  we  must  ask  a  natural 
question:  What  does  it  mean  for  two  sequences  to  be  similar  (or  even  “the  same”)? 
As  we  will  hopefully  show,  the  bias  matrix  provides  a  somewhat  natural  framework 
for  answering  this  question  and,  thus,  for  comparing  sequences. 

Recall  that  the  bias  matrix  arose  out  of  an  attempt  to  generalize  our  intuitive 


Figure  4.1:  Spike  raster  plot  of  an  impure  sequence  that  induces  an  unambiguous 
neuronal  ordering 


notion  of  a  pure  neuronal  sequence.  Even  with  this  representation  of  sequences  as 
biases  (which  are  vectors),  careful  consideration  should  be  given  to  what  method 
is  employed  to  compare  these  vectors.  It  is  tempting  to  want  to  compare  biases 
using  the  standard  Euclidean  distance,  but  this  may  not  be  the  most  meaningful  of 
comparisons.  To  see  this,  once  again  consider  pure  sequences.  Recalling  Lemma  3.6, 
the  bias  matrix  of  a  pure  sequence  contains  sufficient  information  to  determine  the 
original  neuronal  ordering.  Pure  sequences,  though,  are  not  the  only  sequences  for 
which  the  bias  matrix  allows  us  to  unambiguously  reconstruct  an  ordering  in  the  same 
manner.  Indeed,  some  of  the  sequences  corresponding  to  these  other  bias  matrices 
even  match  up  well  with  our  original  intuition. 

Example  4.1.  Consider  the  spike-raster  plot  in  Figure  4.1.  Though  the  spikes  of  the 
neurons  are  interlaced,  it  seems  obvious  to  the  casual  observer  that  there  is  only  one 
neuronal  ordering  that  makes  sense:  neuron  1  fires,  then  neuron  2,  and  finally  neuron 
3.  This  casual  observation  seems  slightly  less  obvious  when  considering  the  sequence 
representation  s  =  123123,  but  it  emerges  again  upon  computing  the  bias  matrix: 


-Rskew  (  S  ) 


'  0  1/2  l/2' 

-1/2  0  1/2 

.-1/2  —1/2  0. 


1 

2 


5skew(123). 


The  skew-bias  matrix  of  s  is  a  scaling  of  that  of  the  pure  sequence  123.  As  such, 
-Bskew(s)  induces  the  same  ordering  as  -Bskew(123),  namely  the  ordering  (1,2,3). 

From  this  perspective,  it  seems  that  the  underlying  “ordering,”  if  you  will,  of  a 
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bias  matrix  is  characterized  simply  by  the  direction  in  which  the  bias  matrix  points 
(when  considered  as  a  vector).  This  further  suggests  that  the  magnitude  of  a  bias 
vector  is  not  so  much  an  indication  of  an  underlying  order  as  it  is  an  indication  of  the 
purity  of  a  sequence.  It  is  with  this  perspective  that  we  define  a  correlation  between 
sequences. 

Definition  4.2.  We  define  the  product  of  sequences  s  and  s'  to  be  (s,  s')  := 
Si, jew  Pij(s)Pij(s>)  for  w  =  supp(s)  n  supp(s').  Note  that  (s,  s')  =  (s|w,s'|w)  = 
(-E>skew(s|w),  -£>skew(s/|w))>  where  the  last  expression  is  the  standard  Euclidean  inner 
product  of  -Bskew(s|w)  and  Bs kew(s/|w)-  Also,  we  define  the  magnitude  of  a  sequence 
s  to  be  ||s||  :=  yj (s, s).  With  these  expressions  in  hand,  we  can  finally  define  the 
correlation  of  sequences  s  and  s'  to  be 


corr(s,  s') 


supp(s) 


In  terms  of  geometry,  corr(s,s')  is  the  cosine  of  the  angle  between  -Bskew(s|w)  and 
-Bskew^lcj),  where  again  uj  =  supp(s)  Dsupp(s').  In  these  definitions,  we  restrict  to 
the  common  set  of  neurons  in  part  because  noise  is  a  major  consideration  in  real 
data.  A  real  sequence  might  contain  a  spike  from  a  neuron  that  should  not  have  been 
active  but  not  contain  a  spike  from  a  neuron  that  should  have  been  active.  Though 
we  cannot  know  for  sure  which  spikes  “should”  and  “should  not”  be  in  a  sequence, 
we  attempt  to  mitigate  this  problem  slightly  by  comparing  sequences  based  only  on 
the  collection  of  neurons  that  are  active  in  both  sequences. 
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Example  4.3.  Let  s  =  12444433131124  and  s'  =  2431.  Then 
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Noting  that  supp(s)  =  supp(s')  =  [4],  the  correlation  between  s  and  s'  is  corr(s,  s')  = 
H  foy  Try .  Now,  each  component  of  this  can  be  computed  as  follows: 


<s,s')  =  Aj(s)Ai(s') 

4-1  4 

=  Ai(s)Aj(s') 

i= 1  j=i-\-l 

=  2(4  +  l  +  I  +  o-I  +  !) 

_  41 
“60 

llsll  =  \/2(^  +  §  +  i5+0~^  +  i)  = 

||s'||  =  a/12  =  2y/3 


Putting  this  all  together,  we  find  that  corr(s,  s')  =  ~  0.6168. 


4.2  Significance  of  correlation 

While  we  do  now  have  the  ability  to  compare  arbitrary  sequences  using  our  correla¬ 
tion  measure,  there  is  still  an  issue  to  be  addressed:  How  do  we  interpret  a  correlation 
value?  Despite  coming  to  the  bias-matrix  representation  of  a  sequence  and  the  corre¬ 
lation  measure  by  fairly  intuitive  means,  a  correlation  value  does  not  seem  to  have  an 
intuitive  interpretation  beyond  (1)  its  sign  indicating  correlation  or  anti-correlation 
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Figure  4.2:  Both  of  these  pairs  of  sequences  seem  to  be  similar  (to  the  eye),  so  we 
might  expect  them  to  have  relatively  high  correlation  values.  However,  neither  pair 
does:  The  arm-run  sequences  have  a  correlation  value  of  0.52572,  and  the  pure  se¬ 
quences  have  a  correlation  value  of  0.2.  These  low  values  are  somewhat  unsatisfactory 
since  maximally  uncorrelated  sequences  have  a  correlation  value  of  zero. 

and  (2)  larger  magnitude  indicating  strength  of  (anti-)correlation. 

In  Figure  4.2,  we  see  an  example  of  how  a  correlation  value  alone  does  not  nec¬ 
essarily  say  so  much  about  the  similarity  of  two  sequences.  Why  does  something 
like  this  happen?  This  is  partly  an  issue  of  angles  and  dimensionality.  Consider  the 
pure  sequences  in  Figure  4.2b.  By  moving  just  a  single  spike,  we  modified  a  large 
proportion  of  the  entries  in  the  pairwise-bias  matrix.  If  these  short  sequences  had  a 
common  subsequences  prepended  to  them,  then  the  correlation  value  would  be  much 
higher  even  though  the  same  number  of  spikes  are  “out  of  order”;  this  is  because  a 
much  larger  percentage  of  the  pairwise-bias  matrix  entries  are  unmodified  in  this  case. 
When  considering  the  arm-run  sequences  in  Figure  4.2a,  we  notice  that  each  of  the 
neurons  has  spikes  that  are  interlaced  with  spikes  from  many  of  the  other  neurons. 
When  spikes  are  interlaced  in  a  consistent  way  like  in  Example  4.1,  bias  matrices  end 
up  being  scalings  of  pure  biases.  These  spikes,  however,  are  not  interlaced  in  such 
consistent  ways;  and  the  result  is  that  the  bias  matrices  end  up  pointing  in  fairly 
different  directions.  Still,  to  the  eye,  these  long  arm-run  sequences  could  be  much 
more  different  from  each  other  if  we  just  shifted  some  of  their  spikes  around. 

And  here  we  stumble  onto  some  key  intuition:  We  evaluate  the  similarity  of  one 
sequence  s  to  another  s  not  just  on  how  close  s  is  to  s  but  also  on  how  much  more 
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different  they  could  be  from  each  other.  In  a  sense,  we  ask  the  question  of  where  their 
similarity  falls  in  relation  to  the  similarities  of  other  pairs  of  sequences. 

Definition  4.4.  Given  sequences  s  and  s,  the  significance  of  correlation  of  s  to  s  is 
the  proportion  of  shufflings  of  s  that  are  more  strongly  correlated  to  s  than  s  is  to 
s.  More  precisely,  letting  II  =  {n  e  Perms ([t7])  :  |corr(s,  s)|  <  |corr(s,  s71")  | },  then  the 
significance  of  the  correlation  of  s  to  s  is  sigs(s)  :=  . 

Example  4.5.  We  used  computational  techniques  (as  will  be  described  in  Section  4.4) 
to  approximate  the  significance  values  corresponding  to  the  correlations  of  Figure  4.2. 
In  our  simulations,  the  significance  of  the  correlation  values  of  the  arm-run  sequences 
and  of  the  pure  sequences  were  approximately  0.00  and  0.40,  respectively.  Using  a 
standard  p- value  of  0.05,  This  indicates  that  the  correlation  value  between  the  arm- 
run  sequences  is  significant  but  that  the  correlation  value  between  the  pure  sequences 
is  not  significant. 

4.3  Global  significance 

The  previous  section  allows  us  to  compare  collections  of  sequences  while  putting  each 
correlation  on  more  even  footing;  i.e. ,  we  can  set  a  threshold  for  significance  instead  of 
for  (absolute  value  of)  correlation.  Now,  we  are  brought  to  the  question  of  similarity 
of  one  collection  of  sequences  to  another.  Let  U  =  (ui, . . . ,  ua)  and  V  =  (vi, . . . ,  v&) 
be  lists  of  sequences  in  S.  We  want  to  ask  how  similar  is  U  to  V.  One  natural  place 
to  start  answering  this  question  is  to  consider  all  of  the  correlation  values  between  a 
sequence  in  U  and  a  sequence  in  V. 

Definition  4.6.  We  define  the  correlation  matrix  between  U  and  V  to  be  Corr(I7,  V)  := 
[corr(u,  v)]ue[/,  vev- 
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Though  we  can  look  for  commonalities  and  statistics  among  the  entries  of  this 
matrix  (e.g.,  that  it  has  all  positive  entries  or  by  considering  the  distribution  of 
values),  we  face  a  similar  problem  as  when  looking  at  individual  correlation  values: 
Aside  from  basic  properties  (such  as  having  only  positive  entries),  we  do  not  have  a 
good  interpretation  of  the  magnitude  of  correlation  (as  demonstrated  in  the  examples 
of  the  previous  section).  As  such,  we  will  approach  the  answer  to  this  qeustion  in 
the  same  way  as  for  individual  correlations:  by  considering  significance  values.  But 
we  will  need  to  iterate  the  process,  in  a  sense,  because  we  are  comparing  collections 
instead  of  just  individual  sequences. 

Definition  4.7.  Define  the  significance  matrix  of  correlation  of  U  to  V  to  be 
Sigy(T7)  :=  [sigv(u)]ueC/)  vey. 

The  significance  matrix  allows  us  to  identify  how  significantly  correlated  each 
sequence  in  U  is  to  each  sequence  in  V.  Still,  though,  the  question  remains: 
How  do  we  convert  this  information  to  some  sort  of  meaningful  value  to  tell  us 
how  similar  U  is  to  V?  Given  a  significance  parameter  p  G  (0, 1),  we  can  count 
how  many  pairs  (u,  v)  G  U  x  V  satisfy  sigv(u)  <  p.  Though  this  gives  us,  for 
instance,  the  percentage  of  pairs  that  are  significantly  correlated  (as  determined 
by  p).  we  have  still  not  quite  reached  a  satisfactory  measure.  What  percent¬ 
age  of  pairs  is  a  significant  percentage?  It  really  depends  on  the  collections  U 
and  V.  As  we  compared  corr(s,  s)  to  a  distribution  in  the  previous  section,  we 
must  find  an  appropriate  distribution  of  values  to  which  we  can  compare  the  count 
sig v(U,p)  :=  | { (u,  v)  G  U  x  V  |  sigv(u)  <  p}\.  Naturally,  as  before,  we  want  to  com¬ 
pare  this  value  to  the  counts  gotten  by  comparing  a  random  shuffling  of  the  sequenes 
in  U  to  V .  If  7?  =  (7Ti, . . .  ,7 r0)  G  n(C/)  :=  Perms([lcn(u1)])  x  •  •  •  x  Perms([len(ua)]), 
define  Un  :=  (u^1, . . . ,  u£“).  Now,  the  global  significance  of  correlation  of  U  to  V  is 
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gv(U,p)  =  where  A  =  {7?  e  II (U)  |  sigy(I/7r, p)  >  sig v(U,p)}.  Hence,  we  have 
a  measure  telling  us  how  significantly  correlated  the  list  U  is  to  the  list  V. 

4.4  Monte-Carlo  computation  of  significance 

Mathematically,  the  significance  of  correlation  and  global  significance  of  correlation 
values  described  in  the  previous  two  sections  allow  us  to  have  confidence  that  two 
sequences  or  two  collections  of  sequences  share  (or  do  not  share)  similarities.  In 
practice,  however,  the  distributions  necessary  for  computation  of  these  values  are  often 
prohibitively  large.  For  instance,  if  s  is  a  sequence  with  only  ten  spikes,  calculation  of 
sigs(s)  requires  computation  of  over  3.6  million  correlation  values.  And  the  situation 
gets  a  lot  worse  as  the  length  of  s  grows,  not  to  mention  the  added  complexities  when 
trying  to  compute  a  global  significance  value. 

Because  of  this  difficulty,  we  employ  a  Monte-Carlo  method  to  approximate  the 
two  types  of  significance  values.  In  the  analysis  that  follows  in  Section  5.3,  each 
sequence  is  a  part  of  some  collection  of  sequences.  We  leverage  this  fact  by  simulta¬ 
neously  approximating  the  distribution  for  both  types  of  significance  values.  Let  T 
be  some  (large)  natural  number  representing  the  number  of  trials  in  our  Monte-Carlo 
approximations.  Further,  let  U  =  (ui, . . . ,  ua)  and  V  be  lists  of  sequences;  and  let 
p  E  (0, 1)  be  a  significance  parameter.  For  each  u  E  U  and  each  v  e  V,  we  will 
approximate  sigv(u)  in  such  a  way  that  the  resultant  distribution  approximations 
of  all  such  pairings  can  also  be  used  to  approximate  the  distributions  necessary  for 
computing  gv(U,p). 

Let  7?  1 , ... . . ,  dy  €  n({7)  be  T  randomly  selected  lists  of  permutations.  For  each 
t  E  [T],  we  compute  and  store  the  correlation  matrix  Corr(V,  Unk).  This  list  of  corre¬ 
lation  matrices  will  allow  us  to  approximate  all  significance  values.  First,  to  compute 
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sigv(um),  note  that  the  list  corr(v,  Um’m  ),■■■,  corr(v,  Um’m)  can  be  extracted  from 
the  list  of  correlation  matrices;  hence,  we  approximate  sigv(um)  by  sigv(u,n)  =  ^ 
where  A  =  {t  e  [T]  :  \  corr(v,um)|  <  |  corr(v,  Um’m)|}.  Indeed,  this  process  produces 
an  approximation  Sig V(U)  of  the  significance  matrix  Sig V(U). 

To  approximate  gy(U )  using  the  list  of  computed  correlation  matrices,  we  note 
first  that  the  list  U7Tk  is  a  shuffling  not  only  of  the  list  U  but  also  of  the  list 
U17*'  for  each  t'  G  [ T ]  \  {t}.  As  such,  we  can  also  approximate  the  signifi¬ 
cance  matrices  Sigy(f/7ft)  by  Sigl/(177?t)  =  [sigv(um’m)]veV',me[a]  with  sigv(um’m)  = 
^  where  A  =  e  [T]  :  |  corr(v,  Um’m)|  <  |  corr(v,  With  these  matri¬ 

ces  in  hand,  the  approximation  of  gy(U,p)  is  straightforward:  Let  sig v(U',p)  = 
j(u,v)  e  U  x  V  |  sigy([//,p)| 

{t  e  [T\  I  Slgy(U^,p)  >  sigy(f/,p)|. 


,  from  which  we  define  gy(U,p)  =  ^  where  A  = 
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Chapter  5 

Experimental  set-up  and  data 
analysis 

The  tools  introduced  in  the  preceding  chapters  were  developed  for  use  in  the  anal¬ 
ysis  of  real-world  data.  The  lab  of  Eva  Pastalkova  at  the  Howard  Hughes  Medical 
Institute’s  Janclia  Research  Campus  graciously  provided  all  of  the  data  used  in  this 
document.  The  following  section  describes  the  experimental  setup  from  which  this 
data  was  collected  and  the  various  types  of  neuronal  sequences  that  are  considered  in 
our  analysis.  Section  5.2  then  describes  details  of  our  sequence  detection  techniques. 
Finally,  Section  5.3  uses  bias  matrices  to  perform  some  basic  analyses  of  the  data. 

5.1  The  experimental  setup  and  event  types 

Rats  were  trained  to  perform  a  basic  memory  task  in  the  U-shaped  track  pictured  in 
Figure  5.1.  In  short,  the  task  of  the  animal  can  be  described  as  follows:  To  receive  a 
reward,  the  animal  must  alternate  between  running  to  the  left  reward  port  and  the 
right  reward  port.  The  task  is  complicated  for  the  animal  by  the  delay  area,  which 
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Figure  5.1:  Alternation  memory  task.  The  three  main  areas  of  interest  for  this  task 
are  (1)  the  delay  area  and  running  wheel,  (2)  the  left  and  right  arms/ corridors,  and 
(3)  the  reward  area  at  the  end  of  each  arm.  The  dotted  lines  surrounding  the  delay 
area  represent  mechanically  movable  doors  that  are  used  to  hold  the  animals  in  the 
delay  area  until  the  wheel-run  portion  of  each  trial  has  been  completed. 

forces  the  animal  to  run  in  the  running  wheel  for  a  specified  amount  of  time  before  the 
doors  of  the  delay  area  open  and  allow  the  animal  to  proceed  toward  a  reward  port. 
Each  trial  of  the  experiment  consists  of  a  run  in  the  wheel,  a  visit  to  a  reward  port, 
and  a  return  to  the  delay  area.  By  requiring  the  animal  to  spend  time  running  in  the 
wheel  before  choosing  to  run  to  the  left  or  right,  the  animal  is  forced  to  remember 
the  arm  to  which  it  should  next  run  instead  of  being  allowed  to  run  immediately  from 
one  arm  to  the  other. 

In  addition  to  performing  this  task  under  normal  conditions,  each  animal  per¬ 
formed  the  task  after  the  drug  muscimol  had  been  injected  into  the  animal’s  medial 
septum.  This  injection  has  the  effect  of  shutting  down  the  medial  septum,  which  is 
the  area  of  the  brain  that  serves  as  the  primary  drive  of  the  theta  oscillations  that 
are  thought  to  be  an  integral  part  of  the  generation  of  place-field  sequences  in  the 
hippocampus  during  locomotion. 

During  performance  of  this  task,  the  rat’s  location  within  the  track  was  recorded. 
Further,  local-field  potentials  (LFPs)  were  recorded  at  various  depths  across  the  pyra¬ 
midal  layer  of  the  CAl  region  of  the  rat’s  hippocampus.  Neuronal  spiking  information 
was  then  extracted  from  these  LFPs  via  a  process  called  spike  sorting.  For  our  pur- 
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poses,  we  will  say  that  an  event  is  all  of  the  information  (LFPs,  spike  trains,  animal 
location,  etc.)  within  some  time  window.  The  sequence  corresponding  to  an  event  is 
the  list  of  all  spikes  that  occur  within  that  event.  The  various  event /sequence  types 
that  we  will  be  investigating  are  described  below. 

Wheel-run  events  are  behavioral  events  recorded  while  the  animal  is  running  in 
the  wheel  to  satisfy  the  delay  requirement.  As  such,  each  wheel-run  event 
is  recorded  over  a  period  of  approximately  10  seconds.  These  event  types  are 
generally  the  longest  in  duration  of  all  of  the  event  types  that  we  consider;  thus, 
the  corresponding  wheel-run  sequences  are  usually  the  longest  of  our  recorded 
sequence  types.  Because  the  animals  are  running  during  these  sequences,  the 
hippocampus  exhibits  a  prominent  theta  oscillation  under  normal  conditions. 

Arm-run  events  are  behavioral  events  recorded  while  an  animal  is  running  through 
either  arm  of  the  track  in  either  direction  between  the  delay  area  and  a  reward 
area.  The  duration  of  each  arm-run  event  is  the  length  of  time  thatit  takes 
the  animal  to  get  where  it  is  going,  usually  around  2  seconds.  Though  shorter 
than  wheel-run  sequences,  arm-run  sequences  are  still  usually  quite  long.  The 
locomotion  of  the  animal  is  again  accompanied  by  a  prominent  theta  oscilla¬ 
tion.  Because  the  animals  are  actually  moving  through  the  track  during  these 
sequences,  the  sequences  exhibit  place  fields  under  normal  conditions.  We  di¬ 
vide  these  sequences  into  four  subtypes  based  on  the  trajectory  of  the  animal; 
specifically,  the  animal  can  run  to  (outbound)  or  from  (inbound)  either  the  left 
or  right  reward  area. 

Sharp-wave  ripple  (SWR)  events  usually  occur  while  an  animal  is  not  in  mo¬ 
tion;  hence,  they  primarily  occur  in  the  reward  areas  and  in  the  delay  area 
outside  of  the  wheel,  though  we  placed  no  explicit  restrictions  on  the  location 
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or  movement  of  the  animal  in  our  detection  of  these  events.  As  described  in 
Section  1.1,  these  events  are  characterized  by  a  hyper-polarization  in  the  LFP 
across  the  pyramidal  layer  in  the  CAl  region  of  the  hippocampus,  which  gen¬ 
erally  lasts  from  around  50  to  250  milliseconds.  Given  the  shorter  timescale 
than  the  behavioral  events,  these  sequences  are  generally  much  shorter  than 
wheel-run  and  arm-run  sequences. 

Refined  SWR  events  are  the  bursts  of  neuronal  activity  that  accompany  SWRs. 
These  bursts  of  activity  are  not  always  perfectly  aligned  with  the  corresponding 
sharp  wave,  which  (as  we  will  see  in  the  next  section)  is  the  feature  we  use  to 
detect  the  SWR  events.  Given  that  our  analysis  is  performed  on  sequences  of 
neuronal  spikes,  it  is  important  that  we  strive  to  find  the  sequences  of  spikes 
that  most  accurately  represent  the  neuronal  activity  that  accompanies  the  SWR 
events,  regardless  of  whether  or  not  those  sequences  are  aligned  with  the  ac¬ 
companying  LFP  envelope.  This  refinement  process  results  in  the  discarding 
of  many  events  that  do  not  contain  sufficient  spiking  information.  Thus,  there 
are  fewer  refined  SWR  sequences  than  (unrefined)  SWR  sequences;  but  the 
sequences  that  remain  are  often  longer. 

5.2  SWR  detection  and  refinement 

Unlike  arm-run  and  wheel-run  events,  SWR  events  are  not  behavioral  in  nature. 
In  particular,  no  action  or  activity  of  an  animal  will  tell  us  when  a  SWR  event  is 
occurring.  As  such,  we  must  find  a  way  to  determine  when  these  events  are  occurring. 
To  do  this,  we  consider  one  prominent  feature  of  SWR  events:  the  sharp  wave.  During 
a  sharp  wave,  the  LFPs  across  the  pyramidal  layer  of  CAl  become  hyperpolarized, 
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resulting  in  the  characteristic  shape  that  has  already  been  observed  (see  Figure  1.2). 
Our  method  for  detecting  SWR  events  can  be  summarized  as  follows: 

1.  Select  two  LFP  channels,  a  lower  channel  (i.e.,  one  that  drops  in  value  during 
SWRs)  and  an  upper  channel  (i.e.,  one  that  increases  in  value  during  SWRs). 
In  lieu  of  an  automated  method  for  performing  this,  all  of  the  LFP  channels 
were  selected  by  hand.  The  selection  process  was  done  by  gaining  an  inuitive 
understanding  of  what  SWR  events  look  like  and  then  by  selecting  the  channels 
that  most-faithfully  represented  the  desired  LFP  separation. 

2.  Center  each  LFP  at  zero  by  computing  a  local  average  for  each.  This  is  necessary 
to  account  for  changes  in  the  LFPs  that  occur  over  a  longer  time  scale  than 
SWRs  occur  on.  To  do  this,  we  computed  the  average  output  for  each  channel 
over  the  interval  of  one  second  preceding  each  point  in  time;  we  then  subtracted 
the  resultant  local  averages  from  the  original  signals. 

3.  Create  a  single  sharp-wave  signal  by  subtracting  the  lower  centered-LFP  from 
the  upper  centered-LFP,  subtracting  the  overall  mean  of  the  resultant  signal, 
normalizing  by  the  standard  deviation  of  the  resultant  signal,  and  smoothing 
the  resultant  signal  using  a  gaussian  filter. 

4.  Threshold  the  sharp-wave  signal  to  determine  initial  estimates  for  SWR  event 
windows.  We  set  the  base-line  threshold  at  1.25  standard  deviations  of  the 
sharp-wave  signal. 

5.  Filter,  combine,  and  modify  the  initial  event  estimates  to  ensure  that  the  each 
even  satisfies  properties  characteristic  of  SWRs. 

•  Split  single  events  into  multiple  events.  Sometimes,  SWRs  happen  close 
enough  together  that  the  separation  in  the  LFP  from  the  first  event  has 
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not  ended  before  the  separation  from  the  second  event  has  begun.  As  a 
result,  the  detection  process  up  to  this  point  joins  those  multiple  events 
into  a  single  event.  This  process  of  splitting  is  done  by  finding  the  peaks  of 
the  second  derivative  of  the  smoothed  sharp-wave  signal;  if  a  large  enough 
peak  occurs  within  an  event,  then  the  event  is  split  at  that  point  in  time. 

•  Remove  events  that  do  not  last  for  some  minimum  amount  of  time  (25 
ms). 

•  Shorten  events  that  last  for  too  long.  It  is  sometimes  the  case  that  the 
minimum-required  LFP  separation  of  a  detected  event  can  persist  for 
longer  than  the  underlying  SWR  event.  To  account  for  this  while  not 
discarding  legitimate  SWRs,  we  enforce  a  maximum  SWR  duration  (250 
ms)  by  taking  the  event  in  a  250-ms  time  window  surrounding  the  highest 
point  of  LFP  separation  within  the  event. 

•  Remove  events  in  which  the  peak  of  the  sharp-wave  signal  is  not  large 
enough  (4  standard  deviations  of  the  sharp-wave  signal). 

•  Remove  events  that  do  not  contain  a  sudden  LFP  separation.  Without 
this  characteristic  feature,  the  event  is  not  a  SWR.  We  enforce  this  by 
ensuring  that  the  absolute  value  of  the  first  derivative  of  the  sharp-wave 
signal  passes  above  a  minimum  value. 

•  Ensure  that  enough  spikes  happen  close  enough  to  each  other  within  each 
event.  This  is  done  by  summing  up  gaussian  functions  centered  at  the 
spikes  within  each  event  and  ensuring  that  the  resultant  signal  passes  above 
a  minimum  value. 

The  parameters  used  in  this  detection  process  were  fine-tuned  by  hand  until  the 
resultant  events  were  largely  undeniably  SWRs  but  also  while  not  being  so  restrictive 
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(b)  Muscimol  condition 


(d)  Needs  refinement 


Figure  5.2:  SWR  events.  Panels  (a)  and  (b)  show  standard  examples  of  SWRs. 
Panel  (c)  shows  an  example  of  a  doublet,  two  SWR  events  that  occur  so  close  in  time 
as  to  be  joined  together.  Panel  (d)  shows  an  example  of  a  SWR  for  which  the  burst 
of  neural  activity  significantly  precedes  the  accompanying  sharp  wave.  Such  an  event 
requires  refinement. 


as  to  eliminate  many  good  candidates  for  SWR  events. 

Although  detection  based  on  the  sharp  wave  gives  us  a  good  starting  place  for 
SWR  detection,  a  detection  algorithm  based  solely  on  the  sharp  wave  is  insufficient 
for  our  needs.  While  the  bursts  of  neural  activity  in  a  SWR  are  (by  definition)  accom¬ 
panied  by  a  sharp  wave,  it  is  not  the  case  that  the  two  types  of  activity  are  perfectly 
synchronized  in  time.  See  Figure  5. 2d  for  an  example  of  this.  Since  our  analysis  is 
of  neuronal  sequences,  it  is  crucial  that  we  have  as  accurate  a  set  of  sequences  as 
possible.  Our  attempt  to  extract  the  events  corresponding  to  the  neuronal-activity 
bursts  of  SWR  events  consists  of  four  basic  steps: 


1.  Detect  the  intrinsic  sequences  of  a  recording.  These  are  formed  by  splitting  the 
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spike  train  of  a  recording  wherever  the  gap  between  successive  spikes  exceeds 
some  threshold  and  then  by  extracting  the  sequences  from  each  resultant  spike 
train. 

2.  Discard  all  intrinsic  sequences  that  do  not  overlap  with  any  SWR  event. 

3.  Join  together  those  intrinsic  sequences  that  overlap  with  a  common  SWR  event. 
The  idea  is  that  the  SWR  event  should  represent  a  single  pice  of  sequential 
information.  Any  gap  that  occurs  within  such  a  joining  of  intrinsic  sequences 
is  justified  as  being  within  a  single  sequence  by  arguing  that  other  neurons — 
neurons  that  were  not  detected — were  likely  spiking  within  that  interval. 

4.  Of  the  resulting  collection  of  sequences,  keep  only  those  that  meet  certain  re¬ 
quirements.  First,  we  impose  a  requirement  on  the  minimum  number  of  active 
neurons;  a  sequence  does  not  contain  much  useful  information  to  us  if,  for  in¬ 
stance,  it  contains  spikes  from  only  one  neuron.  Next,  we  impose  a  duration 
condition;  there  is  reason  to  believe  that  SWRs  have  minimum  and  maximum 
duration.  Lastly,  we  require  that  the  sequence  is  isolated  in  time  from  its  near¬ 
est  intrinsic  sequences;  this  is  an  attempt  to  ensure  that  the  activity  captured 
by  the  sequence  is  a  burst  of  activity  accompanying  a  SWR  and  not  due  to 
increased  neural  activity  in  the  area  for  some  other  reason. 

This  process  generally  results  in  significantly  fewer  events  than  the  sharp-wave  detec¬ 
tion  process. 

5.3  Sequence  comparison  in  in-vivo  hippocampus 

It  is  believed  that  theta  oscillations  are  necessary  for  the  generation  of  hippocam¬ 
pal  sequences.  Analyses  suggesting  this  result  [10]  compare  observed  hippocampal 
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sequences  under  normal  and  theta-inhibited  (i.e. ,  muscimol)  conditions  to  neuronal 
templates  generated,  for  instance,  by  considering  the  order  of  place  fields  in  a  track 
under  normal  conditions.  The  introduction  of  the  bias  matrix  allows  for  the  direct 
comparison  of  observed  sequences  to  each  other.  In  this  section,  we  use  the  techniques 
of  Chapter  4  to  perform  similarity  analyses  on  the  hippocampal  sequences  described 
in  Section  5.1.  In  particular,  we  will  attempt  to  determine  whether  the  sequences 
generated  under  the  influence  of  muscimol  are  different  from  those  generated  under 
normal  conditions.  For  reference,  brief  descriptions  of  the  sequence  types  are  given 
in  Table  5.1. 


Table  5.1:  Hippocampal  sequence  types. 


Type 

Subtypes 

Description 

W  heel-run 

Behavioral  sequences  generated  during  the 
enforced  delay  period  while  running  in 
the  wheel 

Arm-run 

Left  /  outbound 
Left  /  inbound 
Right  /  outbound 
Right  /  inbound 

Behavioral  sequences  generated  as  an  animal 
traverses  a  path  between  the  delay  area 
and  a  reward  area 

Refined  SWR 

JN  on- behavioral  sequences  from  SWRs 

Question  5.1.  Are  hippocampal  sequences  the  same  after  elimation  of  theta  oscilla¬ 
tion? 

In  the  end,  our  analysis  suggests  that  eliminating  theta  oscillation  eliminates  (at 
least  some  of)  the  structure  of  behavioral  hippocampal  sequences.  We  approach  this 
question  by  setting  reference  sets  of  sequences  to  which  we  can  compare  normal- 
condition  sequences  and  muscimol-condition  sequences.  Since  the  question  we  are 
trying  to  answer  is  about  whether  there  is  a  change  from  the  normal  behavior,  these 
reference  sets  should  consist  of  normal-condition  sequences.  For  instance,  when  com¬ 
paring  normal-condition  wheel-run  sequences  to  muscimol-condition  wheel-run  se¬ 
quences,  the  reference  set  will  be  the  collection  of  normal-condition  wheel-run  se- 
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quences.  The  basic  tools  used  in  this  section  are  the  correlation  and  significance 
matrices  introduced  in  Section  4.3.  Figure  5.3  provides  an  example  of  the  correlation 
and  significance  matrices  for  each  sequence  type.  The  examples  in  this  figure  illustrate 
how  the  correlation  matrix  can  be  a  useful  tool  for  observing  structure  in  collections 
sequences.  In  particular,  notice  the  difference  in  the  matrix  structure  between  wheel, 
arm,  and  SWR  correlations.  Where  the  wheel-run  correlation  matrix  displays  that  all 
correlations  are  positive,  the  arm-run  correlation  matrix  shows  positive  and  negative 
correlations  existing  in  a  highly  structured  manner  (switching  sign  as  we  go  between 
comparing  subtypes  of  these  sequences);  the  SWR  correlation  matrix,  on  the  other 
hand,  shows  that  positive  and  negative  correlations  exist  in  a  much  less  structured 
way. 

The  differences  in  structure  between  the  correlation  matrices  shown  in  Figure  5.3 
can  be  observed  over  all  data  sets  in  Figure  5.4,  which  shows  histograms  of  the 
correlation  values  for  the  different  types.  For  instance,  looking  at  the  distributions 
of  correlation  values  for  arm-run  sequences,  the  normal-condition  correlations  tend 
to  be  more  positive  than  the  muscimol-condition  correlations.  This  observation  is 
in  line  with  the  intended  purpose  of  the  muscimol  injections  to  eliminate  the  theta 
oscillation  that  is  believed  to  aid  in  the  formation  of  place-field  sequences. 

In  addition  to  looking  for  changes  in  the  distributions  of  correlation  values  after 
injection  of  muscimol,  we  also  investigate  the  change  in  proportion  of  correlations 
that  are  significant  after  injection  of  muscimol,  as  shown  in  Figure  5.5.  For  behav¬ 
ioral  events,  muscimol  is  accompanied  by  a  reduction  in  the  proportion  of  correlations 
that  are  significant.  The  reduction  is  somewhat  modest  for  arm-run  sequences  when 
compared  to  the  reduction  for  wheel-run  sequences.  This  suggests  that  theta  oscilla¬ 
tion  is  more  critical  for  the  formation  of  normal-condition  wheel-run  sequences  than 
for  the  formation  of  normal-condition  arm-run  sequences. 
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In  conclusion,  our  analysis  demonstrates  via  bias-matrix  correlations  that  behav¬ 
ioral  hippocampal  sequences  lose  some  of  their  structure  when  theta  oscillation  is 
removed  by  the  injection  of  musimol  into  an  animal’s  medial  septum. 
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(a)  Wheel  correlation 


(c)  Arm  correlation 


(e)  SWR  correlation 


(b)  Wheel  significance 
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(d)  Arm  significance 


(f)  SWR  significance 


Figure  5.3:  Correlation  and  significance  matrices.  Red  entries  indicate  positive  corre¬ 
lation,  blue  entries  indicate  negative  correlation,  and  white  entries  indicate  incompa¬ 
rable  sequences;  in  significance  matrices,  gray  entries  indicate  insignificant  correlation 
values.  Panels  (a)  and  (b)  show  that  all  wheel  sequences  are  positively  correlated  to 
each  other  and  that  a  majority  of  the  correlations  are  significant.  Panels  (c)  and  (d) 
show  matrices  for  arm-run  sequences;  events  are  sorted  by  their  subtypes:  left/out- 
bound,  left/inbound,  right/outbound,  left/inbound.  Lines  delineate  subtypes.  The 
checkering  pattern  indicates  postitive  correlation  within  a  subtype  and  negative  cor¬ 
relations  between  inbound  and  outbound  events.  Few  correlations  between  subtypes 
are  significant.  Panels  (e)  and  (f)  show  the  sparsity  of  correlation  matrices  for  SWR 
sequences;  additionally,  few  of  the  comparable  SWR  events  are  significantly  corre¬ 
lated.  The  all-red  diagonals  indicate  that  each  event  is  positively  and  significantly 
correlated  to  itself.  All  matrices  were  generated  from  the  same  recording. 
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(a)  Wheel  correlation  (b)  Arm  correlation 


(c)  SWR  correlation 


Figure  5.4:  Histograms  of  correlation  values.  For  each  type  of  sequence,  the  corre¬ 
sponding  plot  above  shows  the  distribution  of  correlation  values  for  normal-condition 
and  muscimol-condition  correlation  values  across  all  data  sets.  In  panel  (a),  the 
wheel-run  histograms  show  the  greater  tendency  of  normal-condition  sequences  to 
be  positively  correlated  (88.75%  positive)  than  their  muscimol  counterparts  (67.86% 
positive).  Panel  (b)  illustrates  a  similar  phenomenon  for  normal-condition  arm-run 
sequences  to  exhibit  stonger  positive  correlations  than  their  muscimol  counterparts. 
Panel  (c)  shows  a  much  less  pronounced  difference  when  considering  SWR  sequences, 
though  the  two-sample  Kolmogorov-Smirnov  test  indicates  that  the  distributions  are 
indeed  different  (p- value  of  0.0097). 
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(a)  Wheel  (b)  Arm 


(c)  SWR 


Figure  5.5:  Proportion  of  correlations  that  are  significant.  For  each  recording  and 
each  correlation/significance  matrix,  we  consider  the  proportion  of  significance  values 
that  are  below  some  threshold  (0.05);  this  process  results  in  two  values,  one  for 
the  normal-condition  sequences  and  one  for  the  muscimol-condition  sequences,  the 
horizontal  and  vertical  axes,  respectively,  of  the  plots  above.  Panels  (a)  and  (b)  show 
that  injection  of  muscimol  is  accompanied  by  a  systematic  drop  in  the  proportion 
significant  correlations  for  both  sets  of  behavioral  sequences.  This  universal  drop  in 
significance  is  not  observed  when  considering  SWR  sequences  in  panel  (c). 
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Appendix  A 


README  for  codebase 


This  code  is  designed  to  provide  tools  for  working  with  neuronal  sequences.  The 
project  started  originally  as  a  set  of  functions  to  detect  sharp-wave  ripples  in  the 
hippocampus.  The  code  breaks  down  into  three  basic  pieces: 

•  The  sequences  folder:  At  the  core  of  this  project,  the  aim  was  to  analyze 
neuronal  sequences.  Our  concept  of  a  sequence  is  simply  a  list  of  spikes.  Be¬ 
cause  we  discard  timing  information  from  the  spike  trains,  the  techniques  imple¬ 
mented  here  are  agnostic  of  the  timescale  on  which  such  a  sequence  was  actually 
recorded. 

•  The  Event  class:  An  Event  is  a  convenience  class  that  was  designed  (1)  to  store 
a  sequence  and  additional  information  about  its  origin  in  a  recording  and  (2) 
to  bundle  together  functions  for  working  with  sequences. 

•  The  NeuralData  class:  A  NeuralData  object  is  our  interface  to  the  raw  data 
from  a  neural  recording.  This  class  provides  functions  for  retrieving  recording 
information,  extracting  LFPs,  and  detecting  various  event  types,  among  some 
other  convenience  functions. 
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The  sections  below  describe  each  of  these  pieces  in  more  detail  along  with  some  basic 
examples  of  how  to  use  this  project.  This  project  is  dependent  on  some  additional 
functionality  found  in  the  matlab-incremented  project. 

A.l  Sequences 

Since  a  sequence  as  we  have  it  defined  is  just  a  list  of  spikes  (i.e.,  neurons),  each 
sequence  represented  in  this  project  as  a  vector.  As  such,  many  MATLAB  functions 
for  vectors  can  come  in  handy.  For  instance,  the  collection  of  neurons  that  are  active 
in  a  sequence  s  can  be  found  with  unique  (s). 

1  s  =  [1,  2,  1,  3,  2,  2,  4,  3,  3,  4] 

2  unique (s) 


A  few  sequence-specific  functions  have  been  written  where  there  was  no  equivalent 
for  vectors.  The  spike-count  vector  can  be  computed  with  spikecount,  and  the  index 
sets  where  neurons  spike  can  be  extracted  with  spikesets. 

1  spikecount (s) 

2  spikesets (s) 


To  visualize  s,  create  a  spike-raster  plot. 


1  spikeraster (s) 


The  center-of-mass  ordering  of  a  sequence  s  can  be  computed  with  sortneurons  (s) . 


1  sortneurons (s) 
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The  bias-count  and  skew-bias  matrices  of  s  can  be  computed  with  biascount  (s)  and 
skewbias(s),  respectively.  Both  of  these  functions  return  a  sparse  matrix. 


1  full (biascount (s) ) 

2  full (skewbias (s) ) 


The  spikes  of  a  sequence  can  be  permuted  to  be  in  a  random  order  using 
shuffle. 


1  u  =  shuffle (s) 


To  compute  the  correlation  between  sequences,  use  correlation. 


1  correlation(s ,  u) 


For  a  list  LI  of  sequences  (i.e. ,  a  vector  cell  array),  use  activity  (LI)  to  return  a 
matrix  in  which  row  i  is  an  indicator  vector  for  the  set  of  neurons  that  are  active  in 

L{  i  }  - 


1  randseq  =  @(~)  randi(5,  1,  3); 

2  LI  =  arrayfcn (randseq,  zeros(3,  1)); 

3  Ll{ : 1 

4  activity(Ll) 


With  a  second  list  L2  of  sequences,  the  number  of  neurons  that  each  pair  of  sequences 
has  in  common  can  be  found  with  coactivity  (LI ,  L2). 


1  L2  =  arrayfcn (randseq,  zeros(3,  1)); 

2  L2{ : 1 

3  coactivity (LI ,  L2) 
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A.  2  Events 

When  dealing  with  sequences  in  real  data,  it  is  often  necessary  for  purposes  of  analysis 
to  know  more  about  each  sequence.  In  concept,  an  event  is  all  of  the  information  about 
a  recording  that  occurs  within  some  time  window;  as  such,  the  Event  object  class 
stores  this  time  window.  However,  extracting  information  from  a  NeuralData  object 
can  be  time-consuming;  so  this  class  also  stores  some  additional  information.  The 
members  of  this  class  are  listed  below: 

window  This  parameter  stores  the  time  window  during  which  an  event  occurs.  It  is 
stored  as  a  vector  of  the  form  [start,  end] .  For  consistency  regardless  of  the 
sampling  rate  of  a  recording,  this  is  stored  with  units  of  seconds. 

spikes  This  parameter  stores  the  list  of  spikes  that  occur  during  an  event.  That  is, 
spikes  is  the  neuronal  sequence  contained  within  an  Event  object. 

times  This  parameter  stores  the  times  corresponding  to  the  event’s  spikes. 

type  This  parameter  allows  the  user  to  label  each  event  as  some  specific  type.  A 
type  can  be  an  arbitrary  string. 

Though  an  Event  can  be  created  manually,  it  is  often  most  useful  in  that  several 
methods  of  the  NeuralData  class  return  (lists  of)  events.  Because  the  Event  class 
was  designed  primarily  as  a  wrapper  for  sequences,  much  of  the  functionality  possible 
with  sequences  can  be  done  using  events.  To  see  the  available  methods,  type  methods 
Event  at  a  MATLAB  prompt. 
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A. 3  Interfacing  with  real  data 

The  code  in  the  NeuralData  class  was  originally  designed  to  provide  functionality 
for  detecting  sharp-wave  ripples  in  LFP  signals.  For  the  sake  of  example,  sup¬ 
pose  that  the  directory  Alll-20150701  contains  the  files  Alll-20150701.dat  and 
Alll-20150701_BehavElecDataLFP  .mat.  Also,  assume  that  cache  is  a  directory.  We 
can  create  a  NeuralData  object  by  specifying  the  data  and  cache  directories. 

1  nd  =  NeuralData( ’ Alll-20150701 ’ ,  ’cache’); 


We  can  now  extract  various  kinds  of  information  from  the  object  nd.  To  detect  SWR 
events,  three  channels  need  to  be  selected  by  hand  using  an  external  tool.  Once  the 
channels  have  been  selected,  load  them  with  loadChannels. 

1  loadChannels (nd) 


The  SWR  events  can  now  be  detected  with  detectRipples. 


1  cellRipples  =  detectRipples (nd) ; 


The  output  of  this  method  is  a  cell  array  of  Event  objects.  These  events  can  be 
visualized  with  browseEvents. 


1  browseEvents (nd,  cellRipples) 


The  locations  of  the  animal  in  the  track  at  a  particular  list  of  times  can  be  retrieved 
with  the  methods  getLocationsAtTimes.  For  instance,  we  might  want  to  know  where 
ripples  occurred  in  the  track.  To  do  this,  we  will  need  a  single  time  to  represent  each 
event.  Let’s  choose  the  starting  time  of  the  events. 
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1  vTimes  =  cellfun(@startTime ,  cellRipples) ; 

2  mtxLocs  =  getLocationsAtTimes (nd,  vTimes); 


We  can  now  visualize  where  the  ripples  occur  on  the  track. 


1  vWinX  =  minmax (mtxLocs ( : ,  1)); 

2  vWinY  =  minmax (mtxLocs( : ,  2)); 

3  dStd  =  0.1  *  min(diff (vWinX) ,  diff (vWinY) ) ; 

4  nResolution  =  1000; 

5 

6  imagesc (psth2d (mtxLocs ,  vWinX,  vWinY,  dStd,  nResolution)) 


Many  other  NeuralData  methods  exist  for  extraction  of  recording  information  and 
event  information. 
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Appendix  B 


Supplementary  figures 
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(a)  Normal  condition 
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(b)  Muscimol  condition 
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(c)  Increase  of  occurrence 


Figure  B.l:  SWR  occurrence.  Panel  (a)  shows  that  under  normal  conditions,  SWR 
events  occur  almost  exclusively  at  the  reward  areas,  though  some  small  number  occur 
within  the  delay  area.  Panel  (b)  shows  that  injection  of  muscimol  is  accompanied  by 
an  increased  number  of  SWRs  occurring  within  the  delay  area.  Panel  (c)  shows  the 
ratio  of  the  number  of  muscimol  events  to  the  number  of  normal-condition  events  on 
a  per-recording  basis. 
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Figure  B.2:  Sequence  length.  The  left  column  shows  the  relationship  between  the 
duration  of  an  event  and  the  corresponding  sequence’s  length.  The  right  column 
shows  the  relationship  between  the  total  number  of  neurons  that  are  active  within  an 
event  and  the  corresponding  sequence’s  length.  Each  plot  in  the  right  column  shows 
that  a  linear  dependence  exists  between  the  number  of  active  cells  in  an  event  and 
the  total  length  of  the  sequence. 
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(a)  Wheel-run.  Median:  13.30  / 
12.94;  KS:  0.0046 


(b)  Arm-run.  Median:  1.84  /  2.16; 
KS:  1. 


(c)  Left  /  outbound.  Median:  1.66 
/  1.96;  KS:  0. 


(d)  Left  /  inbound.  Median:  2.25  / 
2.74;  KS:  1. 


(e)  Right  /  outbound.  Median: 
1.64  /  1.97;  KS:  8. 


(f)  Right  /  inbound.  Median:  2.65 
/  2.70;  KS:  0. 


(g)  Refined  SWR.  Median:  0.084  / 
0.082;  KS:  2.99 


Figure  B.3:  Event  duration.  For  each  event  type,  the  distribution  of  event  durations 
is  shown  under  normal  (solid  line)  and  muscimol  (dashed  line)  conditions.  The  me¬ 
dian  of  each  distribution  is  shown  as  normal  /  median.  In  each  case,  the  two-sample 
Kolmogorov-Smirnov  test  (KS)  tells  us  the  p-value  with  with  we  reject  the  null  hy¬ 
pothesis  that  the  two  distributions  are  the  same.  For  each  behavioral  event  type 
(all  but  SWR),  the  injection  of  muscimol  tends  to  create  slightly  longer  events.  In 
contrast,  muscimol  tends  to  create  slightly  shorter  SWR  events. 
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Figure  B.4:  Number  of  active  neurons.  The  vertical  axis  shows  the  ratio  of  the  median 
number  of  active  neurons  in  a  muscimol  event  to  the  median  number  of  active  neurons 
in  an  event  under  normal  conditions.  After  injection  of  muscimol,  the  median  number 
of  active  cells  in  wheel-run  sequences  tends  to  decrease  slightly,  whereas  the  median 
number  of  The  median  number  of  active  neurons  changes  after.  No  change  is  observed 
in  the  number  of  active  neurons  in  arm-run  sequences. 
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